Glossary of Notation 


Non-alphabetic notation 


Lx] 


adjacency relation 
successor relation (digraph) 
isomorphism relation 
congruence relation 
implication 

floor of number 

ceiling of number 

tlyedey FE} 

absolute value of number 
size of set 

set description 

infinity 

empty set 

union 

intersection 

subset 

subgraph 

subgraph of G induced by S 
complement of graph or set 
(planar) dual 

kth power of graph 

set of k-tuples from § 

edge cut 

source-sink cut 

deletion of vertex 

deletion of edge 
contraction of edge 

disjoint union of graphs 
join of graphs 

cartesian product of graphs 
symmetric difference 
vertex duplication 

vertex multiplication 
cartesian product of sets 
difference of sets 

binomial coefficient 
multinomial coefficient 


n-vector with all entries 1 
conditional variable or event 
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Roman alphabet 


c(e) 

cap(S, T) 

iy 0045p 
d(v), dg(v) 
d*(v), d~(v) 
D 

D(G) 

d(u, v) 

diam G 
det A 

E(G) 

E(X) 

e(G) 

ftv), f7(S) 
S-(), FS) 


adjacency matrix 
adjugate matrix 
bandwidth 

bases of matroid 
circuits of matroid 
cycle with n vertices 
power of a cycle 
number of components 
circumference 
(Hamiltonian) closure 
cost or capacity 
capacity of a cut 
degree sequence 
degree of vertex 
out-degree, in-degree 
digraph 

distance sum 

distance from u to v 
diameter 

determinant 

edge set 

expected value 

size (number of edges) 
total exiting flow 

total entering flow 
function, flow 

number of faces 

graph (or digraph) 
random graph in Model A 
Harary graph 
independent sets of matroid 
identity matrix 
matrix of ali 1’s 
complete graph 
complete bipartite graph 
line graph 

lower bound on flow 
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Preface 


Graph theory is a delightful playground for the exploration of proof tech- 
niques in discrete mathematics, and its results have applications in many areas 
of the computing, social, and natural sciences. The design of this book permits 
usage in a one-semester introduction at the undergraduate or beginning grad- 
uate level, or in a patient two-semester introduction. No previous knowledge of 
graph theory is assumed. Many algorithms and applications are included, but 
the focus is on understanding the structure of graphs and the techniques used 
to analyze problems in graph theory. 

Many textbooks have been written about graph theory. Due to its em- 
phasis on both proofs and applications, the initial model for this book was 
the elegant text by J.A. Bondy and U.S.R. Murty, Graph Theory with Applica- 
tions (Macmillan/North-Holland [1976]). Graph theory is still young, and no 
consensus has emerged on how the introductory material should be presented. 
Selection and order of topics, choice of proofs, objectives, and underlying themes 
are matters of lively debate. Revising this book dozens of times has taught me 
the difficulty of these decisions. This book is my contribution to the debate. 


The Second Edition 
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The revision for the second edition emphasizes making the text easier for 
the students to learn from and easier for the instructor to teach from. There 
have not been great changes in the overall content of the book, but the presen- 
tation has been modified to make the material more accessible, especially in 
the early parts of the book. Some of the changes are discussed in more detail 
later in this preface; here I provide a brief summary. 


e Optional material within non-optional sections is now designated by (*); such 
material is not used later and can be skipped. Most of it is intended to be 
skipped in a one-semester course. When a subsection is marked “optional”, 
the entire subsection is optional, and hence no individual items are starred. 

e For less-experienced students, Appendix A has been added as a reference sum- 
mary of helpful material on sets, logical statements, induction, counting 
arguments, binomial coefficients, relations, and the pigeonhole principle. 


xi 
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e Many proofs have been reworded in more patient language with additional 
details, and more examples have been added. 

e More than 350 exercises have been added, mostly.easier exercises in Chapters 
1-7. There are now more than 1200 exercises. 

e More than 100 illustrations have been added; there are now more than 400. 
In illustrations showing several types of edges, the switch to bold and solid 
edges instead of solid and dashed edges has increased clarity. 

e Easier problems are now grouped at the beginning of each exercise section, 
usable as warm-ups. Statements of some exercises have been clarified. 

e In addition to hints accompanying the exercise statements, there is now an 
appendix of supplemental hints. 

e For easier access, terms being defined are in bold type, and the vast majority 
of them appear in Definition items. 

e For easier access, the glossary of notation has been placed on the inside covers. 

e Material involving Eulerian circuits, digraphs, and Turan’s Theorem has been 
relocated to facilitate more efficient learning. 

e Chapters 6 and 7 have been switched to introduce the idea of planarity earlier, 
and the section on complexity has become an appendix. 

e The glossary has been improved to eliminate errors and to emphasize items 
more directly related to the text. 


Features 
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Various features of this book facilitate students’ efforts to understand the 
material. There is discussion of proof techniques, more than 1200 exercises of 
varying difficulty, more than 400 illustrations, and many examples. Proofs are 
presented in full in the text. 

Many undergraduates begin a course in graph theory with little exposure 
to proof techniques. Appendix A provides background reading that will help 
them get started. Students who have difficulty understanding or writing proofs 
in the early material should be encouraged to read this appendix in conjunction 
with Chapter 1. Some discussion of proof techniques still appears in the early 
sections of the text (especially concerning induction), but an expanded treat- 
ment of the basic background (especially concerning sets, functions, relations, 
and elementary counting) is now in Appendix A. 

Most of the exercises require proofs. Many undergraduates have had lit- 
tle practice at presenting explanations, and this hinders their appreciation of 
graph theory and other mathematics. The intellectual discipline of justifying an 
argument is valuable. independently of mathematics; I hope that students will 
appreciate this. In writing solutions to exercises, students should be careful in 
their use of language (“say what you mean”), and they should be intellectually 
honest (“mean what you say”). 

Although many terms in graph theory suggest their definitions, the quan- 
tity of terminology remains an obstacle to fluency. Mathematicians like to 
gather definitions at the start, but most students succeed better if they use a 
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concept before receiving the next. This, plus experience and requests from re- 
viewers, has led me to postpone many definitions until they are needed. For 
example, the definition of cartesian product appears in Section 5.1 with color- 
ing problems. Line graphs are defined in Section 4.2 with Menger’s Theorem 
and in Section 7.1 with edge-coloring. The definitions of induced subgraph and 
join have now been postponed to Section 1.2 and Section 3.1, respectively. 

I have changed the treatment of digraphs substantially by postponing their 
introduction to Section 1.4. Introducing digraphs at the same time as graphs 
tends to confuse or overwhelm students. Waiting to the end of Chapter 1 al- 
lows them to become comfortable with basic concepts in the context of a single 
model. The discussion of digraphs then reinforces some of those concepts while 
clarifying the distinctions. The two models are still discussed together in the 
material on connectivity. 

This book contains more material than most introductory texts in graph 
theory. Collecting the advanced material as a final optional chapter of “addi- 
tional topics” permits usage at different levels. The undergraduate introduction 
consists of the first seven chapters (omitting most optional material), leaving 
Chapter 8 as topical reading for interested students. A graduate course can 
treat most of Chapters 1 and 2 as recommended reading, moving rapidly to 
Chapter 3 in class and reaching some.topics in Chapter 8. Chapter 8 can also 
be used as the basis for a second course in graph theory, along with material 
that was optional in earlier chapters. = 

Many results in graph theory have several proofs; illustrating this can in- 
crease students’ flexibility in trying multiple approaches to a problem. I include 
some alternative proofs as remarks and others as exercises. 

Many exercises have hints, some-given with the exercise statement and 
others in Appendix C, Exercises marked “(—)” or “(+)” are easier or more dif- 
ficult, respectively, than unmarked problems. Those marked “(+)” should not 
be assigned as homework in a typical undergraduate course. Exercises marked 
“(!)” are especially valuable, instructive, or entertaining. Those marked “(*)” 
use material labeled optional in the text. 

Each exercise section begins with a set of “(—)” exercises, ordered according 
to the material in the section and ending with a line of bullets. These exercises 
either check understanding of concepts or are immediate applications of results 
in the section. I recommend some of these to my class as “warmup” exercises 
to check their understanding before working the main homework problems, 
most of which are marked “(!)”. Most problems marked “(—)” are good exam 
questions. When using other exercises on exams, it may be a good idea to 
provide hints from Appendix C. 

Exercises that relate several concepts appear when the last is introduced. 
Many pointers to exercises appear in the text where relevant concepts are dis- 
cussed. An exercise in the current section is cited by giving only its item 
number among the exercises of that section. Other cross-references are by 
Chapter.Section.Item. 
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Organization and Modifications 


In the first edition, I sought a development that was intellectually coherent 
and displayed a gradual (not monotonic) increase in difficulty of proofs and in 
algorithmic complexity. 

Carrying this further in the second edition, Eulerian circuits and Hamilto- 
nian cycles are now even farther apart. The simple characterization of Eulerian 
circuits is now in Section 1.2 with material closely related to it. The remain- 
der of the former Section 2.4 has been dispersed to relevant locations in other 
sections, with Fleury’s Algorithm dropped. 

Chapter 1 has been substantially rewritten. I continue to avoid the term 
“multigraph”; it causes more trouble than it resolves, because many students 
assume that a multigraph must have multiple edges. It is less distracting to 
append the word “simple” where needed and keep “graph” as the general ob- 
ject, with occasional statements that in particular contexts it makes sense to 
consider only simple graphs. 

The treatment of definitions in Chapter 1 has been made more friendly 
and precise, particularly those involving paths, trails, and walks. The informal 
groupings of basic definitions in Section 1.1 have been replaced by Definition 
items that help students find definitions more easily. 

In addition to the material on isomorphism, Section 1.1 now has a more 
precise treatment of the Petersen graph and an explicit introduction of the 
notions of decomposition and girth. This provides language that facilitates 
later discussion in various places, and it Ge interesting explicit questions 
other than isomorphism. 

Sections 1.2—1.4 have become nee coherent. The treatment of Eulerian 
circuits motivates and completes Section 1.2. Some material has been removed 
from Section 1.3 to narrow its focus to degrees and counting, and this section 
has acquired the material on vertex degrees that had been in Section 1.4. Sec- 
tion 1.4 now provides the introduction to digraphs and can be treated lightly. 

Trees and distance appear together in Chapter 2 due to the many relations 
between these topics. Many exercises combine these notions, and algorithms 
to compute distances produce or use trees. 

Most graph theorists agree that the Kénig-Egervary Theorem deserves an 
independent proof without network flow. Also, students have trouble distin- 
guishing “k-connected” from “connectivity k”, which have the same relationship 
as “k-colorable” and “chromatic number k”. I therefore treat matching first and 
later use matching to prove Menger’s Theorem. Both matching and connectivity 
are used in the coloring material. 

In response to requests from a number of users, I have added a short op- 
tional subsection on dominating sets at the end of Section 3.1. The material 
on weighted bipartite matching has been clarified by emphasis on vertex cover 
instead of augmenting path and by better use of examples. 

Turan’s Theorem uses only elementary ideas about vertex degrees and in- 
duction and hence appeared in Chapter 1 in the first edition. This caused some 
difficulties, because it was the most abstract item up to that point and students 
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felt somewhat overwhelmed by it. Thus I have kept the simple triangle-free 
case (Mantel’s Theorem) in Section 1.3 and have moved the full theorem to 
Section 5.2 under the viewpoint of extremal problems related to coloring. 

The chapter on planarity now comes before that on “Edges and Cycles”. 
When an instructor is short of time, planarity is more important to reach than 
the material on edge-coloring and Hamiltonian cycles. The questions involved 
in planarity appeal intuitively to students due to their visual aspects, and many 
students have encountered these questions before. Also, the ideas involved in 
discussing planar graphs seem more intellectually broadening in relation to the 
earlier material of the course than the ideas used to prove the basic results on 
edge-coloring and Hamiltonian cycles. 

Finally, discussing planarity first makes the material of Chapter 7 more 
coherent. The new arrangement permits a more thorough discussion of the 
relationships among planarity, edge-coloring, and Hamiltonian cycles, lead- 
ing naturally beyond the Four Color Theorem to the optional new material on 
nowhere-zero flows as a dual concept to coloring. 

When students discover that the coloring and Hamiltonian cycle scopes 
lack good algorithms, many become curious about NP-completeness. Appendix 
B satisfies this curiosity. Presentation of NP-completeness via formal languages 
can be technically abstract, so some students appreciate a presentation in the 
context of graph problems. NP-completeness proofs also illustrate the variety 
and usefulness of “graph transformation” arguments. 

The text explores relationships among fundamental results. Petersen’s 
Theorem on 2-factors (Chapter 3) uses Eulerian circuits and bipartite match- 
ing. The equivalence between Menger’s Theorem and the Max Flow-Min Cut 
Theorem is explored more fully than in the first edition, and the “Baseball Elim- 
ination” application is now treated in more depth. The k — 1-connectedness of 
k-color-critical graphs (Chapter 5) uses bipartite matching. Section 5.3 offers 
a brief introduction to perfect graphs, emphasizing chordal graphs. Additional 
features of this text in comparison to some others include the algorithmic proof 
of Vizing’s Theorem and the proof of Kuratowski’s Theorem by Thomassen’s 
methods. 

There are various other additions and improvements in the first seven 
chapters. There is now a brief discussion of Heawood’s Formula and the 
Robertson—Seymour Theorem at the end of Chapter 6. In Section 7.1, a proof 
of Shannon’s bound on the edge-chromatic number has been added. In Section 
5.3, the characterization of chordal graphs is somewhat simpler than before by 
proving a stronger result about simplicial vertices. In Section 6.3, the proof of 
the reducibility of the Birkhoff diamond has been eliminated, but a brief dis- 
cussion of discharging has been added. The material discussing issues in the 
proof of the theorem is optional, and the aim is to give the flavor of the approach 
without getting into detailed arguments. From this viewpoint the reducibility 
proof seemed out of focus. 

Chapter 8 contains highlights of advanced material and is not intended for 
an undergraduate course. It assumes more sophistication than earlier chapters 
and is written more tersely. Its sections are independent; each selects appeal- 
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ing results from a large topic that merits a chapter of its own. Some of these 
sections become more difficult near the end; an instructor may prefer to sample 
early material in several sections rather than present one completely. 

There may be occasional relationships between items in Chapter 8 and 
items marked optional in the first seven chapters, but generally cross-references 
indicate the connections. The material of Chapter 8 has not changed substan- 
tially since the first edition, although many corrections have been made and 
the presentation has been clarified in many places. 

I will treat advanced graph theory more thoroughly in The Art of Combina- 
torics. Volume I is devoted to extremal graph theory and Volume II to structure 
of graphs. Volume III has chapters on matroids and integer programming (in- 
cluding network flow). Volume IV emphasizes methods in combinatorics and 
discusses various aspects of graphs, especially random graphs. 


Design of Courses 


I intend the 22 sections in Chapters 1-7 for a pace of slightly under two 
lectures per section when most optional material (starred items and optional 
subsections) is skipped. When I teach the course I spend eight lectures on 
Chapter 1, six lectures each on Chapters 4 and 5, and five lectures on each of 
Chapters 2, 3, 6, and 7. This presents the fundamental material in about 40 
lectures. Some instructors may want tospend more time on Chapter 1 and omit 
more material from later chapters. 

In chapters after the first, the most fundamental material is concentrated 
in the first section. Emphasizing these sections (while skipping the optional 
items) still illustrates the scope of graph theory in a slower-paced one-semester 
course. From the second sections of Chapters 2, 4, 5, 6, and 7, it would be bene- 
ficial to include Cayley’s Formula, Menger’s Theorem, Mycielski’s construction, 
Kuratowski’s Theorem, and Dirac’s Theorem (spanning cycles), respectively. 

Some optional material is particularly appealing to present in class. For 
example, I always present the optional subsections on Disjoint Spanning Trees 
(in Section 2.1) and Stable Matchings (in Section 3.2), and I usually present 
the optional subsection on f-factors (in Section 3.3). Subsections are marked 
optional when no later material in the first seven chapters requires them and 
they are not part of the central development of elementary graph theory, but 
these are nice applications that engage students’ interest. In one sense, the 
“optional” marking indicates to students that the final examination is unlikely 
to have questions on these topics. 

Graduate courses skimming the first two chapters might include from them 
such topics as graphic sequences, kernels of digraphs, Cayley’s Formula, the 
Matrix Tree Theorem, and Kruskal’s algorithm. 

Courses that introduce graph theory in one term under the quarter system 
must aim for highlights; I suggest the following rough syllabus: 1.1: adjacency 
matrix, isomorphism, Petersen graph. 1.2: all. 1.3: degree-sum formula, large 
bipartite subgraphs. 1.4: up to strong components, plus tournaments. 2.1: up 
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to centers of trees. 2.2: up to statement of Matrix Tree Theorem. 2.3: Kruskal’s 
algorithm. 3.1: almost all. 3.2: none. 3.3: statement of Tutte’s Theorem, proof 
of Petersen’s results. 4.1: up to definition of blocks, omitting Harary graphs. 
4.2: wp to open ear decomposition, plus statement of Menger’s Theorem(s). 
4.3: duality between flows and cuts, statement of Max-flow = Min-cut. 5.1: 
up to Szekeres-Wilf theorem. 5.2: Mycielski’s construction, possibly Turan’s 
Theorem. 5.3: up to chromatic recurrence, plus perfection of chordal graphs. 
6.1: non-planarity of K; and K33, examples of dual graphs, Euler’s formula 
with applications. 6.2: statement and examples of Kuratowski’s Theorem and 
Tutte’s Theorem. 6.3: 5-color Theorem, plus the idea of crossing number. 7.1: 
up to Vizing’s Theorem. 7.2: up to Ore’s condition, plus the Chvatal-Erdés 
condition. 7.3: Tait’s Theorem, Grinberg’s Theorem. 


Further Pedagogical Aspects 


In the revision I have emphasized some themes that arise naturally from 
the material; underscoring these in lecture helps provide continuity. 

More emphasis has been given to: the theme of TONCAS—“The obvious 
necessary condition is also sufficient.”> Explicit mention has been added that 
many of the fundamental results can be viewed in this way. This both provides 
a theme for the course and clarifies the distinction between the easy direction 
and the hard direction in an equivalence. 

Another theme that underlies much of Chapters 3—5 and Section 7.1 is that 
of dual maximization and minimization problems. In a graph theory course one 
does not want to delve deeply into the nature of duality in linear optimization. 
It suffices to say that two optimization problems form a dual pair when every 
feasible solution to the maximization problem has value at most the value of 
every feasible solution to the minimization problem. When feasible solutions 
with the same value are given for the two problems, this duality implies that 
both solutions are optimal. A discussion of the linear programming context 
appears in Section 8.1. 

Other themes can be identified among the proof techniques. One is the 
use of extremality to give short proofs and avoid the use of induction. Another 
is the paradigm for proving conditional statements by induction, as described 
explicitly in Remark 1.3.25. 

The development leading to Kuratowski’s Theorem is somewhat long. Nev- 
ertheless, it is preferable to present the proof in a single lecture. The prelim- 
inary lemmas reducing the problem to the 3-connected case can be treated 
lightly to save time. Note that the induction paradigm leads naturally to the 
two lemmas proved for the 3-connected case. Note also that the proof uses the 
notion of S-lobe defined in Section 5.2. 

The first lecture in Chapter 6 should not belabor technical definitions of 
drawings and regions. These are better left as intuitive notions unless students 
ask for details; the precise statements appear in the text. 
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The motivating applications of digraphs in Section 1.4 have been marked 
optional because they are not needed in the rest of the text, but they help clarify 
that the appropriate model (graph or digraph) depends on the application. 

Due to its reduced emphasis on numerical computation and increased em- 
phasize on techniques of proof and clarity of explanations, graph theory is an 
excellent subject in which to encourage students to improve their habits of 
communication, both written and oral. In addition to assigning written home- 
work that requires carefully presented arguments, I have found it productive 
to organize optional “collaborative study” sessions in which students can work 
together on problems while I circulate, listen, and answer questions. It should 
not be forgotten that one of the best ways to discover whether one understands 
a proof is to try to explain it to someone else. The students who participate find 
these sessions very beneficial. 


Acknowledgments 


This text benefited from classroom testing of gradually improving pre- 
publication versions at many universities. Instructors who used the text on 
this experimental basis were, roughly in chronological order, Ed Scheinerman 
(Johns Hopkins), Kathryn Fraughnaugh (Colorado-Denver), Paul Weichsel / 
Paul Schupp / Xiaoyun Lu (Illinois), Dean Hoffman / Pete Johnson / Chris 
Rodger (Auburn), Dan Ullman (George-Washington), Zevi Miller / Dan Pritikin 
(Miami-Ohio), David Matula (Southern Methodist), Pavol Hell (Simon Fraser), 
Grzegorz Kubicki (Louisville), Jeff Smith (Purdue), Ann Trenk (Wellesley), Ken 
Bogart (Dartmouth), Kirk Tolman (Brigham Young), Roger Eggleton (Illinois 
State), Herb Kasube (Bradley), and Jeff Dinitz (Vermont). Many of these (or 
their students) provided suggestions or complaints that led to improvements. 

I thank George Lobell at Prentice Hall for his continuing commitment to 
this project and for finding diligent reviewers. Reviewers Paul Edelman, Renu 
Laskar, Gary MacGillivray, Joseph Neggers, Joseph Malkevitch, James Oxley, 
Sam Stueckle, and Barry Tesman made helpful comments. Reviewers for early 
versions of sections of Chapter 8 included Mike Albertson, Sanjoy Barvah, Dan 
Kleitman, James Oxley, Chris Rodger, and Alan Tucker. Reviewers for the 
second edition included Nate Dean, Dalibor Froncek, Renu Laskar, Michael 
Molloy, David Sumner, and Daniel Ullman. 

Additional comments and corrections between the first and second editions 
came from many readers. These items range from typographical errors to ad- 
ditional exercises to simplifications in proofs. The accumulated effect of these 
contributions is substantial. Most of these suggestions don’t lend themselves to 
attribution within the text, so I express my gratitude here for their willingness 
to contribute their observations, opinions, and expertise. These people include 
Troy Barcume, Stephan Brandt, Gerard Chang, Scott. Clark, Dave Gunderson, 
Dean Hoffman, John D’Angelo, Charles Delzell, Thomas Emden-Weinert, Shi- 
mon:Even, Fred Galvin, Alfio Giarlotta, Don Greenwell, Jing Huang, Garth 


Preface xix 


Isaak, Steve Kilner, Alexandr Kostochka, André Kiindgen, Peter Kwok, Jean- 
Marc Lanlignel, Francois Margot, Alan Mehlenbacher, Joel Miller, Zevi Miller, 
Wendy Myrvold, Charles Parry, Robert Pratt, Dan Pritikin, Radhika Rama- 
murthi, Craig Rasmussen, Bruce Reznick, Jian Shen, Tom Shermer, Warren 
Shreve, Alexander Strehl, Tibor Szabé, Vitaly Voloshin, and C.Q. Zhang. 

Several students found numerous typographical errors in the pre-publication 
version of the second edition (thereby earning extra credit!): Jaspreet Bagga, 
Brandon Bowersox, Mark Chabura, John Chuang, Greg Harfst, Shalene Melo, 
Charlie Pikscher, and Josh Reed. 

The cover drawing for the first edition was produced by Ed Scheinerman 
using BRL-CAD, a product of the U.S. Army Ballistic Research Laboratory. For 
the second edition, the drawing was produced by Maria Muyot using CorelDraw. 

Chris Hartman contributed vital assistance in preparing the bibliography 
for the first edition; additional references have now been added. Ted Harding 
helped resolve typesetting difficulties in other parts of the first edition. 

Students who helped gather data for the index of the first edition included 
Maria Axenovich, Nicole Henley, André Kiindgen, Peter Kwok, Kevin Leuthold, 
John Jozwiak, Radhika Ramamurthi, and Karl Schmidt. Raw data for the 
index of the second edition was gathered using scripts I wrote in perl; Maria 
Muyot and Radhika Ramamurthi assisted with processing of the index and the 
bibliography. 

I prepared the second edition in Tex, the typesetting system for which 
the scientific world owes Donald E. Knuth eternal gratitude. The figures were 
generated using gpic, a product of the;Free Software Foundation. 


| book 


telegran 


Feedback 


I welcome corrections and suggestions, including comments on topics, at- 
tributions of results, updates, suggestions for exercises, typographical errors, 
omissions from the glossary or index, etc. Please send these to me at 


west@math.uiuc.edu 


In particular, I apologize in advance for missing references; please inform 
me of the proper citations! Also, no changes other than corrections of errors 
will be made between printings of this edition. 

I maintain a web site containing a syllabus, errata, updates, and other 
material. Please visit! 


http://www.math.uiuc.edu/~west/igt 


I have corrected all typographical and mathematical errors known to me be- 
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Chapter 1 


Fundamental Concepts 


1.1. What Is a Graph? 


How can we lay cable at minimum cost to make every telephone reachable 
from every other? What is the fastest route from the national capital to each 
state capital? How can n jobs be filled by n people with maximum total utility? 
What is the maximum flow per unit time from source to sink in a network of 
pipes? How many layers does a computer chip need so that wires in the same 
layer don’t cross? How can the season of a sports league be scheduled into the 
minimum number of weeks? In what order should a traveling salesman visit 
cities to minimize travel time? Can we color the regions of every map using 
four colors so that neighboring regions receive different colors? 

These and many other practical problems involve graph theory. In this 
book, we develop the theory of graphs and apply it to such problems. Our 
starting point assumes the mathematical background in Appendix A, where 
basic objects and language of mathematics are discussed. 


c 


telegram.m 


THE DEFINITION 


The problem that is often said to have been the birth of graph theory will 
suggest our basic definition of a graph. 


1.1.1. Example. The Kénigsberg Bridge Problem. The city of Kénigsberg was 
located on the Pregel river in Prussia. The city occupied two islands plus ar- 
eas on both banks. These regions were linked by seven bridges as shown on 
the left below. The citizens wondered whether they could leave home, cross ev- 
ery bridge exactly once, and return home. The problem reduces to traversing 
the figure on the right, with heavy dots representing land masses and curves 
representing bridges. 
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The model on the right makes it easy to argue that the desired traversal 
does not exist. Each time we enter and leave a land mass, we use two bridges 
ending at it. We can also pair the first bridge with the last bridge on the land 
mass where we begin and end. Thus existence of the desired traversal requires 
that each land mass be involved in an even number of bridges. This necessary 
condition did not hold in Konigsberg. _ 


The Konigsberg Bridge Problem becomes more interesting when we show 
in Section 1.2 which configurations have traversals. Meanwhile, the problem 
suggests a general model for discussing such questions. 


ks 


1.1.2. Definition. A graph u is a triple consisting of a vertex set V(G), an 
edge set E(G), and a relation that associates with each edge two vertices 
(not necessarily distinct) called itscendpoints. 


ny 
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We draw a graph on paper by placing each vertex at a point and repre- 
senting each edge by a curve joining the locations of its endpoints. 


1.1.3. Example. In the graph in Example 1.1.1, the vertex set is {x, y, z, w}, 
the edge set is {e1, eg, €3, €4, €5, €g, €7}, and the assignment of endpoints to edges 
can be read from the picture. 

Note that edges e; and eg have the same endpoints, as do e3 and e4. Also, 
if we had a bridge over an inlet, then its ends would be in the same land mass, 
and we would draw it as a curve with both ends at the same point. We have 
appropriate terms for these types of edges in graphs. | 


1.1.4. Definition. A loop is an edge whose endpoints are equal. Multiple 
edges are edges having the same pair of endpoints. 

A simple graph is a graph having no loops or multiple edges. We 
specify a simple graph by its vertex set and edge set, treating the edge set 
as a set of unordered pairs of vertices and writing e = uv (or e = vu) for an 
edge e with endpoints u and v. 

When u and v are the endpoints of an edge, they are adjacent and are 
neighbors. We write u < v for “u is adjacent to v”. 


In many important applications, loops and multiple edges do not arise, and 
we restrict our attention to simple graphs. In this case an edge is determined by 


Section 1.1: What Is a Graph? 3 


its endpoints, so we can name the edge by its endpoints, as stated in Definition 
1.1.4. Thus in a simple graph we view an edge as an unordered pair of vertices 
and can ignore the formality of the relation associating endpoints to edges. This 
book emphasizes simple graphs. 


1.1.5. Example. On the left below are two drawings of a simple graph. The 
vertex set is {u, v, w, x, y}, and the edge set is {uv, uw, ux, vx, vw, xw, xy}. 

The terms “vertex” and “edge” arise from solid geometry. A cube has ver- 
tices and edges, and these form the vertex set and edge set of a graph. It is 
drawn on the right below, omitting the names of vertices and edges. | 


J 


A graph is finite if its vertex set and edge set are finite. We adopt the con- 
vention that every graph mentioned in this book is finite, unless explicitly 
constructed otherwise. 


math_| 


1.1.6.* Remark. The null graph is the graph whose vertex set and edge set 
are empty. Extending general theorems to the null graph introduces unnec- 
essary distractions, so we ignore it. All statements and exercises should be 
considered only for graphs with a nonempty set of vertices. | 


GRAPHS AS MODELS 


Graphs arise in many settings. The applications suggest useful concepts 
and terminology about the structure of graphs. 


1.1.7. Example. Acquaintance relations and subgraphs. Does every set of six 
people contain three mutual acquaintances or three mutual strangers? Since 
“acquaintance” is symmetric, we model it using a simple graph with a vertex 
for each person and an edge for each acquainted pair. The “nonacquaintance” 
relation on the same set yields another graph with the “complementary” set of 
edges. We introduce terms for these concepts. | 
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1.1.8. Definition. The complement G of a simple graph G is the simple graph 
with vertex set V(G) defined by uv € E(G) if and only if uv ¢ E(G). A 
clique in a graph is a set of pairwise adjacent vertices. An independent 
set (or stable set) in a graph is a set of pairwise nonadjacent vertices. 


In the graph G of Example 1.1.7, {u, x, y} is a clique of size 3 and {u, w} is 
an independent set of size 2, and these are the largest such sets. These values 
reverse in the complement G, since cliques become independent sets (and vice 
versa) under complementation. The question in Example 1.1.7 asks whether it 
is true that every 6-vertex graph has a clique of size 3 or an independent set of 
size 3 (Exercise 29). Deleting edge ux from G yields a 5-vertex graph having no 
clique or independent set of size 3. 


1.1.9. Example. Job assignments and bipartite graphs. We have m jobs and 
n people,-but not all people are qualified for all jobs. Can we fill the jobs with 
qualified people? We model this using a simple graph H with vertices for the 
jobs and people; job j is adjacent to person p if p can do j. 

Each job is to be filled by exactly one person, and each person can hold at 
most one of the jobs. Thus we seek m pairwise disjoint edges in H (viewing 
edges as pairs of vertices). Chapter 3 shows how to test for this; it can’t be done 
in the graph below. 7 

The use of graphs to model relations between two disjoint sets has many 
important applications. These are the-graphs whose vertex sets can be parti- 
tioned into two independent sets; we need a name for them. | 


d jobs 


1.1.10. Definition. A graph G is bipartite if V(G) is the union of two disjoint 
(possibly empty) independent sets called partite sets of G. 


) 
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1.1.11. Example. Scheduling and graph coloring.. Suppose we must schedule 
Senate committee meetings into designated weekiy time periods. We cannot 
assign two committees to the same time if they have a common member. How 
many different time periods do we need? 

We create a vertex for each committee, with two vertices adjacent when 
their committees have a common member. We must assign labels (time periods) 
to the vertices so the endpoints of each edge receive different labels. In the 
graph below, we can use one label for each of the three independent sets of 
vertices grouped closely together. The members of a clique must receive distinct 
labels, so in this example the minimum number of time periods is three. 

Since we are only interested in partitioning the vertices, and the labels 
have no numerical value, it is convenient to call them colors. | 
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1.1.12. Definition. The chromatic number of a graph G, written x(G), is 
the minimum number of colors needed to label the vertices so that adjacent 
vertices receive different colors. A graph G is k-partite if V(G) can be 
expressed as the union of k (possibly empty) independent sets. 


This generalizes the idea of bipartite graphs, which are 2-partite. Vertices 
given the same color must form an independent set, so x(G) is the minimum 
number of independent sets needed to partition V(G). A graph is k-partite if 
and only if its chromatic number is at most k. We use the term “partite set” 
when referring to a set in a partition into independent sets. 

We study chromatic number and graph colorings in Chapter 5. The most 
(in)famous problem in graph theory involves coloring of “maps”. 

1.1.13. Example. Maps and coloring, Roughly speaking, a map is a partition 
of the plane into connected regions. Can we color the regions of every map 
using at most four colors so that neighboring regions have different colors? 

To relate map coloring to graph coloring, we introduce a vertex for each 
region and an edge for regions sharing a boundary. The map question asks 
whether the resulting graph must have chromatic number at most 4. The graph 
can be drawn in the plane without crossing edges; such graphs are planar. 
The graph before Definition 1.1.12 is planar; that drawing has a crossing, but 
another drawing has no crossings. We study planar graphs in Chapter 6. a 


1.1.14. Example. Routes in road networks. We can model a road network using 
a graph with edges corresponding to road segments between intersections. We 
can assign edge weights to measure distance or travel time. In this context 
edges do represent physical links. How can we find the shortest route from x 
to y? We show how to compute this in Chapter 2. 

If the vertices of the graph represent our house and other places to visit, 
then we may want to follow a route that visits every vertex exactly once, so as 
to visit everyone without overstaying our welcome. We consider the existence 
of such a route in Chapter 7. 

We need terms to describe these two types of routes in graphs. a 


1.1.15. Definition. A path is a simple graph whose vertices can be ordered 
so that two vertices are adjacent if and only if they are consecutive in the 
‘list. A cycle is a graph with an equal number of vertices and edges whose 
vertices can be placed around a circle so that two vertices are adjacent if 
and only if they appear consecutively along the circle. 
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Xx g Xx z 
x 0 a » ea 
y b y b 


Above we show a path and a cycle, as demonstrated by listing the vertices 
in the order x, b,a,z, y. Dropping one edge from a cycle produces a path. In 
studying the routes in road networks, we think of paths and cycles contained 
in the graph. Also, we hope that every vertex in the network can be reached 
from every other. The next definition makes these concepts precise. 


1.1.16. Definition. A subgraph of a graph G is a graph H such that V(H) C 
V(G) and E(H) © E(G) and the assignment of endpoints to edges in H is 
the same as 1n G. We then write H C G and say that “G contains H”. 
A graph G is connected if each pair of vertices in G belongs to a path; 
otherwise, G is disconnected. 


The graph before Definition 1.1.12 has three subgraphs that are cycles. It 
is a connected graph, but the graph in Example 1.1.9 is not. 


MATRICES AND ISOMORPHISM 


a 


boc 


me 


How do we specify a graph? We can list the vertices and edges (with end- 
points), but there are other useful representations. Saying that a graph is 
loopless means that multiple edges are allowed but loops are not. 


1.1.17. Definition. Let G be a loopless graph with vertex set V(G) = {v1,..., vn} 
and edge set E(G) = {e1,...,em}. The adjacency matrix of G, written 
A(G), is the n-by-n matrix in which entry a; ; is the number of edges in G 
with endpoints {v;, v;}. The incidence matrix M(G) is the n-by-m matrix 
in which entry m, ; is 1 if v; is an endpoint of e; and otherwise is 0. 

If vertex v is an endpoint of edge e, then v and e are incident. The 
degree of vertex v (in a loopless graph) is the number of incident edges. 


The appropriate way to define adjacency matrix, incidence matrix, or ver- 
tex degrees for graphs with loops depends on the application; Sections 1.2 and 
1.3 discuss this. 


1.1.18. Remark. An adjacency matrix is determined by a vertex ordering. Ev- 
ery adjacency matrix is symmetric (qa; ; = a;, for all i, 7). An adjacency matrix 
of a simple graph G has entries 0 or 1, with Os on the diagonal. The degree of 
v is the sum of the entries in the row for v in either A(G) or M(G). a 


1.1.19. Example. For the loopless graph G below, we exhibit the adjacency 
matrix and incidence matrix that result from the vertex ordering w, x, y, z and 
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the edge ordering a, b,c,d,e. The degree of y is 4, by viewing the graph or ied 
summing the row for y in either matrix. 


wx y Zz abecede 

wf/O0 11 0 wf1l1 00 0 

x11 0 2 0 x}101 1 0 

y{12 01 y?/O1111 

z\0O 01 0 z\0 000 1 
A(G) M(G) 


Presenting an adjacency matrix for a graph implicitly names the vertices 
by the order of the rows; the ith vertex corresponds to the ith row and column. 
Storing a graph in a computer requires naming the vertices. 

Nevertheless, we want to study properties (like connectedness) that do not 
depend on these names. Intuitively, the structural properties of G and H will 
be the same if we can rename the vertices of G using the vertices in H so that G 
will actually become H. We make the definition precise for simple graphs. The 
renaming is a function from V(G) to V(A) that assigns each element of V(H) 
to one element of V(G), thus pairing them up. Such a function is a one-to-one 
correspondence or bijection (see Appendix A). Saying that the renaming turns 
G into H is saying that the vertex bijection preserves the adjacency relation. 


1.1.20. Definition. An isomorphism from a simple graph G to a simple 
graph H is a bijection f: V(G) > V(H) such that uv € E(G) if and only if 
fu) f(r) € E(A). We say “G is isomorphic to H”, written G = H, if there 
is an isomorphism from G to H. 


1.1.21. Example. The graphs G and H drawn below are 4-vertex paths. Define 
_ the function f: V(G) > V(A) by f(w) =a, f(x) =d, fly) = b, f(z) =c. To 
show that f is an isomorphism, we check that f preserves edges and non- 
edges. Note that rewriting A(G) by placing the rows in the order w, y, z, x and 
the columns also in that order yields A(H), as illustrated below; this verifies 
that f is an isomorphism. 


Another isomorphism maps w, x, y, z to c, b, d, a, respectively. | 
w y c d 
eYl "xX 
x Zz a b 


N te & S 
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OOrR-F x 
Ke OOo 8 
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1.1.22. Remark. Finding isomorphisms. As suggested in Example 1.1.21, pre- 
senting the adjacency matrices with vertices ordered so that the matrices are 
identical is one way to prove that two graphs are isomorphic. Applying a per- 
mutation o to both the rows and the columns of A(G) has the effect of reordering 
the vertices of G. If the new matrix equals A(H), then the permutation yields 
an isomorphism. One can also verify preservation of the adjacency relation 
without writing out the matrices. 

In order for an explicit vertex bijection to be an isomorphism from G to H, 
the image in H of a vertex v in G must behave in H as v does in G. For example, 
they must have the same degree. a 


1.1.23.* Remark. [somorphism for non-simple graphs. The definition of iso- 
morphism extends to graphs with loops and multiple edges, but the precise 
statement needs the language of Definition 1.1.2. 


An isomorphism from G to H is a bijection f that maps V(G) to 
V(H) and E(G) to E(A) such each edge of G with endpoints u and v is 
mapped to an edge with endpoints f(u) and f(v). 
This technicality will not concern us, because we will study isomorphism only 
in the context of simple graphs. é | 


‘books 


Since H is isomorphic to G whenever G is isomorphic to H, we often say 
“G and H are isomorphic” (meaning toeach other). The adjective “isomorphic” 
applies only to pairs of graphs; “G is isomorphic” by itself has no meaning (we 
respond, “isomorphic to what?”). Similarly, we may Say that a set of graphs is 
“pairwise isomorphic” (taken two at a time), but it doesn’t make sense to say 
“this set of graphs is isomorphic”. 

A relation on a set S is a collection of ordered pairs from S. An equiv- 
alence relation is a relation that is reflexive, symmetric, and transitive (see 
Appendix A). For example, the adjacency relation on the set of vertices of a 
graph is symmetric, but it is not reflexive and rarely is transitive. On the other 
hand, the isomorphism relation, consisting of the set of ordered pairs (G, H) 
such that G is isomorphic to H, does have all three properties. 


1.1.24. Proposition. The isomorphism relation is an equivalence relation on 
the set of (simple) graphs. 


Proof: Reflexive property. The identity permutation on V(G) is an isomorphism 
from G to itself. Thus G = G. 

Symmetric property. If f: V(G) > V(A) is an isomorphism from G to H, 
then f~/ is an isomorphism from H to G, because the statement “uv € E(G) if 
and only if f(u)f(v) € E(H)” yields “xy € E(H#) if and only if f~1(x) f-1(y) € 
E(H)”. Thus G = H implies H =G. 

Transitive property. Suppose that f: V(F) — V(G) and g: V(G) > V(A) 
are isomorphisms. We are given “uv € E(F) if and only if f(u) f(v) € E(G)” 
and “xy € E(G) if and only if g(x)g(y) € E(H)”. Since f is an isomorphism, for 
every xy € E(G) we can find uv € E(F) such that f(u) = x and f(v) = y. This 
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yields “uv € E(F) if and only if g(f(u))g(f(v)) € E(AH)”. Thus the composition 
go f is an isomorphism from F to H. We have proved that F = G andG =H 
together imply F = H. u 


An equivalence relation partitions a set into equivalence classes; two 
elements satisfy the relation if and only if they lie in the same class. 


1.1.25. Definition. An isomorphism class of graphs is an equivalence class 
of graphs under the isomorphism relation. 


Paths with n vertices are pairwise isomorphic; the set of all n-vertex paths 
forms an isomorphism class. 


1.1.26. Remark. “Unlabeled” graphs and isomorphism classes. When dis- 
cussing a graph G, we have a fixed vertex set, but our structural comments 
apply also to every graph isomorphic to G. Our conclusions are independent 
of the names (labels) of the vertices. Thus, we use the informal expression 
“unlabeled graph” to mean an isomorphism class of graphs. 

When we draw a graph, its vertices are named by their physical locations, 
even if we give them no other names. Hence a drawing of a graph is a member - 
of its isomorphism class, and we just call it a graph. When we redraw a graph 
to display some structural aspect, we have chosen a more convenient member 
of the isomorphism class, still discussing the same “unlabeled graph”. a 

When discussing structure of graphs, it is convenient to have names and 
notation for important isomorphism classes. We want the flexibility to refer to 
the isomorphism class or to any representative of it. 


1.1.27. Definition. The (unlabeled) path and cycle with n vertices are denoted 
P, and C,, respectively; an n-cycle is a cycle with n vertices. A com- 
plete graph is a simple graph whose vertices are pairwise adjacent; the 
(unlabeled) complete graph with n vertices is denoted K,. A complete 
bipartite graph or biclique is a simple bipartite graph such that two 
vertices are adjacent if and only if they are in different partite sets. When 
the sets have sizes r and s, the (unlabeled) biclique is denoted K,.,. 


Ks Ko3 


1.1.28.* Remark. We have defined a complete graph as a graph whose vertices 
are pairwise adjacent, while a clique is a set of pairwise adjacent vertices in a 
graph. Many authors use the terms interchangeably, but the distinction allows 
us to discuss cliques in the same language as independent sets. 
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In the bipartite setting, we simply use “biclique” to abbreviate “complete 
bipartite graph”. The alternative name “biclique” is a reminder that a complete 
bipartite graph is generally not a complete graph (Exercise 1). a 


1.1.29. Remark. A graph by any other name. . . When we name a graph 
without naming its vertices, we often mean its isomorphism class. Technically, 
“H is‘a subgraph of G” means that some subgraph of G is isomorphic to H (we 
say “G contains a copy of H”). Thus C3 is a subgraph of K; (every complete 
graph with 5 vertices has 10 subgraphs isomorphic to C3) but not of Ko3. 
Similarly, asking whether G “is” C,, means asking whether G is isomorphic 
to a cycle with n vertices. i 


The structural properties of a graph are determined by its adjacency re- 
lation and hence are preserved by isomorphism. We can prove that G and 
H are not isomorphic by finding some structural property in which they dif- 
fer. If they have different number of edges, or different subgraphs, or different 
complements, etc., then they cannot be isomorphic. 

On the other hand, checking that a few structural properties are the same 
does not imply that G = H. To prove that G = H, we must present a bijection 
f: V(G) > V(A) that preserves the adjacency relation. 


1.1.30. Example. Isomorphic or not?=Each graph below has six vertices and 
nine edges and is connected, but these graphs are not pairwise isomorphic. 

To prove that G; = Go, we give names to the vertices, specify a bijection, 
and check that it preserves the adjacency relation. As labeled below, the bijec- 
tion that sends u, v, w, x, y,z to 1, 3, 5,2, 4, 6, respectively, is an isomorphism 
from G,; to Gg. The map sending u, v, w, x, y, z to 6, 4, 2, 1, 3, 5, respectively, is 
another isomorphism. 

Both G,; and Gz are bipartite; they are drawings of K33 (as is G4). The 
graph G3 contains K3, so its vertices cannot be partitioned into two independent 
sets. Thus G3 is not isomorphic to the others. 

Sometimes we can test isomorphism quickly using the complements. Sim- 
ple graphs G and H are isomorphic if and only if their complements are iso- 
morphic (Exercise 4). Here G1, G2, G4 all consist of two disjoint 3-cycles and 
are not connected, but G3 is a 6-cycle and is connected. | 


uov w 6 1 - 
x y Z 4 3 
G1 Go G3 G4 


1.1.31. Example. The number of n-vertex graphs. When choosing two vertices 
from a set of size n, we can pick one and then the other but don’t care about the 
order, so the number of ways is n(n —1)/2. (The notation for the number of ways 
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to choose k elements from n elements is (i) read “n choose k”. These numbers 
are called binomial coefficients; see Appendix A for further background.) 

In a simple graph with a vertex set X of size n, each vertex pair may form 
an edge or may not. Making the choice for each pair specifies the graph, so the 
number of simple graphs with vertex set X is 2). 

For example, there are 64 simple graphs on a fixed set of four vertices. 
These graphs form only 11 isomorphism classes. The classes appear below in 
complementary pairs; only P, is isomorphic to its complement. Isomorphism 
classes have different sizes, so we cannot count the isomorphism classes of n- 
vertex simple graphs by dividing 2) by the size of a class. | 


NS OM 
MMe ON | 


DECOMPOSITION AND SPECIAL GRAPHS 


The example P, = P4 suggests a family of graph problems. 


ne 


1.1.32. Definition. A graph is self-complementary if it is isomorphic to its 
complement. A decomposition of a graph is a list of subgraphs such that 
each edge appears in exactly one subgraph in the list. 


An n-vertex graph H is self-complementary if and only if K,, has a decom- 
position consisting of two copies of H. 


1.1.33. Example. We can decompose K; into two 5-cycles, and thus the 5-cycle 
is self-complementary. Any n-vertex graph and its complement decompose K,,. 
Also K,,,-1 and K,_;.decompose K,,, even though one of these subgraphs omits 
a vertex. On the right below we show a decomposition of K, using three copies 
of P3. Exercises 31—39 consider graph decompositions. | 


1.1.34.* Example. The question of which complete graphs decompose into 
copies of K3 is a fundamental question in the theory of combinatorial designs. 
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On the left below we suggest such a decomposition for K7. Rotating the triangle 
through seven positions uses each edge exactly once. 

On the right we suggest a decomposition of Kg into copies of Py. Placing 
one vertex in the center groups the edges into three types: the outer 5-cycle, the 
inner (crossing) 5-cycle on those vertices, and the edges involving the central 
vertex. Each 4-vertex path in the decomposition uses one edge of each type; we 
rotate the picture to get the next path. | 


We referred to a copy of K3 as atriangle. Short names for graphs that arise 
frequently in structural discussions can be convenient. 


1.1.35. Example. The Graph Menagerie. A catchy “name” for a graph usually 
comes from some drawing of the graph, We also use such a name for all other 
drawings, and hence it is best viewed as a name for the isomorphism class. 
Below we give names to several graphs with at most five vertices. 

Among these the most important are the triangle (K3) and the claw (K;,3). 
We also sometimes discuss the paw (K1.3 + e)and the kite (K4 — e); the others 
arise less frequently. 

The complements of the graphs in the first row are disconnected. The 
complement of the house is P;, and the bull is self-complementary. Exercise 39 


asks which of these graphs can be used to decompose Kg. | 
triangle claw kite 
house bull bowtie 


In order to decompose H into copies of G, the number of edges of G must 
divide the number of edges of H. This is not sufficient, since Ks does not 
decompose into two copies of the kite. 
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1.1.36. Definition. The Petersen graph is the simple graph whose vertices 
are the 2-element subsets of a 5-element set and whose edges are the pairs 
of disjoint 2-element subsets. 


12 


Q 
24 


23 51 


We have drawn the Petersen graph in three ways above. It is a useful 
example so often that an entire book was devoted to it (Holton—Sheehan [1993]). 
Its properties follow from the statement of its adjacency relation that we have 
used as the definition. 


1.1.37. Example. Structure of the Petersen graph. Using [5] = {1, 2, 3, 4, 5} 
as our 5-element set, we write the pair {a, b} as ab or ba. Since 12 and 34 are 
disjoint, they are adjacent vertices when we form the graph, but 12 and 23 are 
not. For each 2-set ab, there are three-ways to pick a 2-set from the remaining 
three elements of [5], so every vertex has degree 3. 

The Petersen graph consists of two disjoint 5-cycles plus edges that pair 
up vertices on the two 5-cycles. The disjointness defirition tells us that 
12, 34, 51, 23, 45 in order are the vertices of a 5-cycle, and similarly this holds 
for the remaining vertices 13, 52, 41, 35, 24. Also 13 is adjacent to 45, and 52 is 
adjacent to 34, and so on, as shown on the left above. 

We use this name even when we do not specify the vertex labeling; in 
essence, we use “Petersen graph” to name an isomorphism class. To show that 
the graphs above are pair-/ise isomorphic, it suffices to name the vertices of 
each using the 2-element subsets of [5] so that in each case the adjacency rela- 
tion is disjointness (Exercise 24). a 


1.1.38. Proposition. If two vertices are nonadjacent in the Petersen graph, 
then they have exactly one common neighbor. 


Proof: Nonadjacent vertices are 2-sets sharing one element; their union S has 
size 3. A vertex adjacent to both is a 2-set disjoint from both. Since the 2-sets 
are chosen from {1, 2, 3, 4, 5}, there is exactly one 2-set disjoint from S. a 


1.1.39. Definition. The girth of a graph with a cycle is the length of its short- 
est cycle. A graph with no cycle has infinite girth. 


1.1.40. Corollary. The Petersen graph has girth 5. 


Proof: The graph is simple, so it has no 1-cycle or 2-cycle. A 3-cycle would 
require three pairwise-disjoint 2-sets, which can’t occur among 5 elements. 


14 Chapter 1: Fundamental Concepts 


A 4-cycle in the absence of 3-cycles would require nonadjacent vertices with 
two common neighbors, which Proposition 1.1.38 forbids. Finally, the vertices 
12, 34, 51, 23, 45 yield a 5-cycle, so the girth is 5. | 


The Petersen graph is highly symmetric. Every permutation of {1, 2, 3, 4, 5} 
generates a permutation of the 2-subsets that preserves the disjointness rela- 
tion. Thus there are at least 5! = 120 isomorphisms from the Petersen graph 
to itself. Exercise 43 confirms that there are no others. 


1.1.41.* Definition. An automorphism of G is an isomorphism from G to G. 
A graph G is vertex-transitive if for every pair u,v € V(G) there is an 
automorphism that maps u to v. 


The automorphisms of G are the permutations of V(G) that can be applied 
to both the rows and the columns of A(G) without changing A(G). 


1.1.42.* Example. Automorphisms. Let G be the path with vertex set 
{1, 2,3, 4} and edge set {12, 23, 34}. This graph has two automorphisms: the 
identity permutation and the permutation that switches 1 with 4 and switches 
2 with 3. Interchanging vertices 1 and 2 is not an automorphism of G, although 
G is isomorphic to the graph with vertex set {1, 2, 3, 4} and edge set {21, 13, 34}. 

In K,;, permuting the vertices of one partite set does not change the ad- 
jacency matrix; this leads to r!s! automorphisms. When r = s, we can also 
interchange the partite sets; K,, has 2(t!)? automorphisms. 

The biclique K,., is vertex-transitive if and only ifr = s. Ifn > 2, then P, is 
not vertex-transitive, but every cycle is vertex-transitive. The Petersen graph 
is vertex-transitive. - a 


We can prove a statement for every vertex in a vertex-transitive graph by 
proving it for one vertex. Vertex-transitivity guarantees that the graph “looks 
the same” from each vertex. 


EXERCISES 


Solutions to problems generally require clear explanations written in sentences. 
The designations on problems have the following meanings: 

“(—)” = easier or shorter than most, 

“(+)” = harder or longer than most, 

“(!)” = particularly useful or instructive, 

“(*)” = involves concepts marked optional in the text. 
The exercise sections begin with easier problems to check understanding, ending with 
a line of dots. The remaining problems roughly follow the order of material in the text. 


1.1.1. (—) Determine which complete bipartite graphs are complete graphs. 


1.1.2. (—) Write down all possible adjacency matrices and incidence matrices for a 3- 
vertex path. Also write down an adjacency matrix for a path with six vertices and for a 
cycle with six vertices. 
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1.1.3. (—) Using rectangular blocks whose entries are all equal, write down an adjacency 
matrix for K,,.,- 


1.1.4. (—) From the definition of isomorphism, prove that G = H if and only if G = H. 


1.1.5. (—) Prove or disprove: If every vertex of a simple graph G has degree 2, then G 
is a cycle. 


1.1.6. (—) Determine whether the graph below decomposes into copies of P,. 


1.1.7. (—) Prove that a graph with more than six vertices of odd degree cannot be 
decomposed into three paths. 


1.1.8. (—) Prove that the 8-vertex graph on the left below decomposes into copies of K1.3 
and also into copies of P4. 


m.me/math_books 


1.1.9. (—) Prove that the graph on the rig 
the graph on the left. 


it above is isomorphic to the complement of 


ou 


teles 


1.1.10. (—) Prove or disprove: The complement of a simple disconnected graph must be 
connected. 


1.1.11. Determine the maximum size of a clique and the maximum size of an indepen- 
dent set in the graph below. 


1.1.12. Determine whether the Petersen graph is bipartite, and find the size of its 
largest independent set. 


1.1.13. Let G be the graph whose vertex set is the set of k-tuples with coordinates 
in {0,1}, with x adjacent to y when x and y differ in exactly one position. Determine 
whether G is bipartite. 


1.1.14. (!) Prove that removing opposite corner squares from an 8-by-8 checkerboard 
leaves a subboard that cannot be partitioned into 1-by-2 and 2-by-1 rectangles. Using 
the same argument, make a general statement about all bipartite graphs. 
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1.1.15. Consider the following four families of graphs: A = {paths}, B = {cycles}, C = 
{complete graphs}, D = {bipartite graphs}. For each pair of these families, determine 
all isomorphism classes of graphs that belong to both families. 


1.1.16. Determine whether the graphs below are isomorphic. 


en 


1.1.17. Determine the number of isomorphism classes of simple 7-vertex graphs in 
which every vertex has degree 4. 


1.1.18. Determine which pairs of graphs below are isomorphic. 


. 0 a 
Vv & 
n B 
u y 
t x 
4 Y 
S 4 W € 5 


1% book 


1.1.19. Determine which pairs of graphs below are isomorphic. 


Boe 


1.1.20. Determine which pairs of graphs below are isomorphic. 


# B® © 


1.1.21. Determine whether the graphs below are bipartite and whether they are iso- 
morphic. (The graph on the left appears on the cover of Wilson—Watkins [1990].) 


nN 
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1.1.22. (!) Determine which pairs of graphs below are isomorphic, presenting the proof 
by testing the smallest possible number of pairs. 


OSS © 


1.1.23. In each class below, determine the smallest n such that there exist nonisomor- 
phic n-vertex graphs having the same list of vertex degrees. 


(a) all graphs, (b) loopless graphs, (c) simple graphs. 


(Hint: Since each class contains the next, the answers form a nondecreasing triple. For 
part (c), use the list of isomorphism classes in Example 1.1.31.) 


1.1.24. (!) Prove that the graphs below are all drawings of the Petersen graph (Definition 
1.1.36). (Hint: Use the disjointness definition of adjacency.) 


ep Axed OO 


1.1.25. (!) Prove that the Petersen graph has no cycle of length 7. 


ye/math_books 


se] 


telagr 1m.1 


4 


1.1.26. (!) Let G be a graph with girth 4 in which every vertex has degree k. Prove that 
G has at least 2k vertices. Determine all such graphs with exactly 2k vertices. 


1.1.27. (!) Let G be a graph with girth 5. Prove that if every vertex of G has degree at 
least k, then G has at least k? + 1 vertices. For k = 2 and k = 8, find one such graph 
with exactly k? + 1 vertices. 


1.1.28. (+) The Odd Graph O,. The vertices of the graph O, are the k-element subsets 
of {1,2,...,2k +1}. Two vertices are adjacent if and only if they are disjoint sets. Thus 
O2 is the Petersen graph. Prove that the girth of O, is 6 ifk > 3. 


1.1.29. Prove that every set of six people contains (at least) three mutual acquaintances 
or three mutual strangers. 


1.1.30. Let G be a simple graph with adjacency matrix A and incidence matrix M. Prove 
that the degree of v; is the ith diagonal entry in A? and in MM’. What do the entries in 
position (i, 7) of A? and MM’ say about G? 


1.1.31. (!) Prove that a self-complementary graph with n vertices exists if and only ifn 
or n — 1 is divisible by 4. (Hint: When n is divisible by 4, generalize the structure of P, 
by splitting the vertices into four groups. For n = 1 mod 4, add one vertex to the graph 
constructed for n — 1.) 


1.1.32. Determine which bicliques decompose into two isomorphic subgraphs. 


1.1.33. For n =5,n =7, andn = 9, decompose K,, into copies of C,. 
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1.1.34. (!) Decompose the Petersen graph into three connected subgraphs that are pair- 
wise isomorphic. Also decompose it into copies of P,. 


1.1.35. (!) Prove that K, decomposes into three pairwise-isomorphic subgraphs if and 
only if n + 1 is not divisible by 3. (Hint: For the case where n is divisible by 3, split the 
vertices into three sets of equal size.) 


1.1.36. Prove that if K, decomposes into triangles, then n — 1 or n — 3 is divisible by 6. 


1.1.87. Let G be a graph in which every vertex has degree 3. Prove that G has no 
decomposition into paths that each have at least 5 vertices. 


1.1.38. (!) Let G be a simple graph in which every vertex has degree 3. Prove that G 
decomposes into claws if and only if G is bipartite. 


1.1.39. (+) Determine which of the graphs in Example 1.1.35 can be used to form a 
decomposition of Kg into pairwise-isomorphic subgraphs. (Hint: Each graph that is not 
excluded by some divisibility condition works.) 


1.1.40. («) Count the automorphisms of P,, C,, and K,. 


1.1.41. (*) Construct a simple graph with six vertices that has only one automorphism. 
Construct a simple graph that has exactly three automorphisms. (Hint: Think of a 
rotating triangle with appendages to prevent flips.) 


1.1.42. (*) Verify that the set of automorphisms of G has the following properties: 
a) The composition of two automorphisms is an automorphism. 
b) The identity permutation is an automorphism. 
c) The inverse of an automorphism is also an automorphism. 
d) Composition of automorphisms satisfies the associative property. 
(Comment: Thus the set of automorphisms satisfies the defining properties for a group.) 


1.1.43. («) Automorphisms of the Petersen graph. Consider the Petersen graph as de- 
fined by disjointness of 2-sets in {1,2,3,4,5}. Prove that every automorphism maps 
the 5-cycle with vertices 12, 34,51, 23,45 to a 5-cycle with vertices ab, cd, ea, bc, de 
determined by a permutation of {1, 2,3, 4,5} taking elements 1,2,3,4,5 to a, b,c, d,e, 
respectively. (Comment: This implies that there are only 120 automorphisms.) 


1.1.44. (*) The Petersen graph has even more symmetry than vertex-transitivity. Let 
P = (Uo, Uj, U2, U3) and Q = (vp, v1, ve, v3) be paths with three edges in the Petersen 
graph. Prove that there is exactly one automorphism of the Petersen graph that maps 
u; into v; for i = 0, 1, 2, 3. (Hint: Use the disjointness description.) 


1.1.45. (*) Construct a graph with 12 vertices in which every vertex has degree 3 and 
the only automorphism is the identity. 


1.1.46. (*) Edge-transitivity. A graph G is edge-transitive if for all e, f € E(G) there 
is an automorphism of G that maps the endpoints of e to the endpoints of f (in either or- 
der). Prove that the graphs of Exercise 1.1.21 are vertex-transitive and edge-transitive. 
(Comment: Complete graphs, bicliques, and the Petersen graph are edge-transitive.) 


1.1.47. («) Edge-transitive versus vertex-transitive. 

a) Let G be obtained from K, with n > 4 by replacing each edge of K, with a path 
of two edges through a new vertex of degree 2. Prove that G is edge-transitive but not 
vertex-transitive. 

b) Suppose that G is edge-transitive but not vertex-transitive and has no vertices 
of degree 0. Prove that G is bipartite. 

c) Prove that the graph in Exercise 1.1.6 is vertex-transitive but not edge-transitive. 
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1.2. Paths, Cycles, and Trails 


In this section we return to the Kénigsberg Bridge Problem, determining 
when it is possible to traverse all the edges of a graph. We also we develop 
useful properties of connection, paths, and cycles. 

Before embarking on this, we review an important technique of proof. Many 
statements in graph theory can be proved using the principle of induction. 
Readers unfamiliar with induction should read the material on this proof tech- 
nique in Appendix A. Here we describe the form of induction that we will use 
most frequently, in order to familiarize the reader with a template for proof. 


1.2.1. Theorem. (Strong Principle of Induction). Let P(n) be a statement with 
an integer parameter n. If the following two conditions hold, then P(n) is 
true for each positive integer n. 

1) P(1) is true. 
2) For all n > 1, “P(k) is true for l< k <n” implies “P(n) is true”. 


Proof: We ASSUME the Well Ordering Property for the positive integers: 
every nonempty set of positive integers has aleast element. Given this, suppose 
that P(n) fails for some n. By the Well Ordering Property, there is a least n such 
that P(n) fails. Statement (1) ensures that this value cannot be 1. Statement 
(2) ensures that this value cannot be greater than 1. The contradiction implies 
that P(n) holds for every positive integer n. | 


In order to apply induction, we verify (1) and (2) for our sequence of state- 
ments. Verifying (1) is the basis step of the proof; verifying (2) is the induction 
step. The statement “P(k) is true for all k < n” is the induction hypothesis, 
because it is the hypothesis of the implication proved in the induction step. The 
variable that indexes the sequence of statements is the induction parameter: 

The induction parameter may be any integer function of the instances of 
our problem, such as the number of vertices or edges in a graph. We say that 
we are using “induction on n” when the induction parameter is n. 

There are many ways to phrase inductive proofs. We can start at 0 to 
prove a statement for nonnegative integers. When our proof of P(n) in the 
induction step makes use only of P(n — 1) from the induction hypothesis, the 
technique is called “ordinary” induction; making use of all previous statements 
is “strong” induction. We seldom distinguish between strong induction and 
ordinary induction; they are equivalent (see Appendix A). 

Most students first learn ordinary induction in the following phrasing: 1) 
verify that P(n) is true when n = 1, and 2) prove that if P(n) is true when n is 
k, then P(n) is also true when n isk + 1. Proving P(k + 1) from P(k) fork > 1 
is equivalent to proving P(n) from P(n — 1) forn > 1. 
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When we focus on proving the statement for the parameter value n in the 
induction step, we need not decide at the outset whether we are using strong 
induction or ordinary induction. The language is also simpler, since we avoid 
introducing a new name for the parameter. In Section 1.3 we will explain why 
this phrasing is also less prone to error. 


CONNECTION IN GRAPHS 


As defined in Definition 1.1.15, paths and cycles are graphs; a path in a 
graph G is a subgraph of G that is a path (similarly for cycles). We introduce 
further definitions to model other movements in graphs. A tourist wandering 
in a city (or a Kénigsberg pedestrian) may want to allow vertex repetitions but 
avoid edge repetitions. A mail carrier delivers mail to houses on both sides of 
the street and hence traverses each edge twice. 


1.2.2. Definition. A walk is a list vo, e1, v1,..., ex, v, of vertices and edges such 
that, for 1 <i <k, the edge e; has endpoints v;_; and v;. A trail is a walk 
with no repeated edge. A u, v-walk or u, v-trail has first vertex u and last 
vertex vu; these are its endpoints.’A u, v-path is a path whose vertices of 
degree 1 (its endpoints) are u and v; the others are internal vertices. 

The length of a walk, trail, path, or cycle is its number of edges. A 
walk or trail is closed if its endpoints are the same. 


ni 


n 


1.2.8. Example. In the Kénigsberg graph (Example 1.1.1), the list x, eg, w, és, y, 
eg, X, €1, W, €2, x is a closed walk of length 5; it repeats edge eg and hence is not 
a trail. Deleting the last edge and vertex yields a trail of length 4; it repeats 
vertices but not edges. The subgraph consisting of edges ¢), e5, eg and vertices 
x,w,y is acycle of length 3; deleting one of its edges yields a path. Two edges 
with the same endpoints (such as e; and eg) form a cycle of length 2. A loop is 
a cycle of length 1. | 


The reason for listing the edges in a walk is to distinguish among multi- 
ple edges when a graph is not simple. In a simple graph, a walk (or trail) is 
completely specified by its ordered list of vertices. We usually name a path, cy- 
cle, trail, or walk in a simple graph by listing only its vertices in order, even 
though it consists of both vertices and edges. When discussing a cycle, we can 
start at any vertex and do not repeat the first vertex at the end. We can use 
parentheses to clarify that this is a cycle and not a path. 


1.2.4. Example. We illustrate the simplified notation in a simple graph. In 
the graph below, a,x, a,x,u, y,c,d, y, v, x, b,a specifies a closed walk of length 
12. Omitting the first two steps yields a closed trail. 

_ The graph has five cycles: (a,b, x), (c, y,d), (u, x,y), (x, y,v), (Uy x, v, y). 
The u, v-trail u, y,c,d, y, x, v contains the edges of the u, v-path u, y, x, v, but 
not of the u, v-path u, y, v. a 
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Suppose we follow a path from u to v in a graph and then follow a path 
from v to w. The result need not be a u, w-path, because the u, v-path and v, w- 
path may have a common internal vertex. Nevertheless, the list of vertices and 
edges that we visit does form a u, w-walk. In the illustration below, the u, w- 
walk contains a u, w-path. Saying that a walk W contains a path P means 
that the vertices and edges of P occur as a sublist of the vertices and edges of 
W, in order but not necessarily consecutive. 


= 
l 
th ? N 
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1.2.5. Lemma. Every u, v-walk contains au, v-path. 


Proof: We prove the statement by induction on the length / of a u, v-walk W. 

Basis step: / = 0. Having no edge, W consists of a single vertex (u = v). 
This vertex is a u, v-path of length 0. ~ 

Induction step: / > 1. We suppose that the claim holds for walks of length 
less than /. If W has no repeated vertex, then its vertices and edges form a 
u,v-path. If W has a repeated vertex w, then deleting the edges and vertices 
between appearances of w (leaving one copy of w) yields a shorter u, v-walk W’ 
contained in W. By the induction hypothesis, W’ contains a u, v-path P, and 
this path P is contained in W. | 


Exercise 13b develops a shorter proof. We apply the lemma to properties 
of connection. 


1.2.6. Definition. A graph G is connected if it has a u, v-path whenever 
u, v € V(G) (otherwise, G is disconnected). If G has a u, v-path, then u is 
connected to v in G. The connection relation on V(G) consists of the 
ordered pairs (u, v) such that u is connected to v. 
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“Connected” is an adjective we apply only to graphs and to pairs of vertices 
(we never say “v is disconnected” when v is a vertex). The phrase “u is connected 
to v” is convenient when writing proofs, but in adopting it we must clarify the 
distinction between connection and adjacency: 


G has au, v-path uv € E(G) 
u and v are connected u and v are adjacent 
u is connected to v u is joined to v 


u is adjacent to v 


1.2.7. Remark. By Lemma 1.2.5, we can prove that a graph is connected by 
showing that from each vertex there is a walk to one particular vertex. 

By Lemma 1.2.5, the connection relation is transitive: if G has a u, v-path 
and a v, w-path, then G has a u, w-path. It is also reflexive (paths of length 0) 
and symmetric (paths are reversible), so it is an equivalence relation. a 


Our next definition leads us to the equivalence classes of the connection 
relation. A maximal connected subgraph of G is a subgraph that is connected 
and is not contained in any other connected subgraph of G. 


1.2.8. Definition. The components of a graph G are its maximal connected 
subgraphs. A component (or graph) is trivial if it has no edges; otherwise 
it is nontrivial. An isolated vertex is a vertex of degree 0. 


c 
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The equivalence classes of the connection relation on V(G) are the vertex 
sets of the components of G. An isolated vertex forms a trivial component, 
consisting of one vertex and no edge. 


1.2.9. Example. The graph below has four components, one being an isolated 
vertex. The vertex sets of the components are {p}, {g,r}, {s,t,u,v,w}, and 
{x, y, z}; these are the equivalence classes of the connection relation. | 


r Ss ouvw y 2 
VN. / 
q t P x 
1.2.10. Remark. Components are pairwise disjoint; no two share a vertex. 
Adding an edge with endpoints in distinct components combines them into one 


component. Thus adding an edge decreases the number of components by 0 or 
1, and deleting an edge increases the number of components by 0 or 1. | 


1.2.11. Proposition. Every graph with n vertices and k edgés has at least n —k 
components. 


Proof: An n-vertex graph with no edges has n components. By Remark 1.2.10, 
each edge added reduces this by at most 1, so when k edges have been added 
the number of components is still at least n — k. | 


Section 1.2: Paths, Cycles, and Trails 23 


Deleting a vertex or an edge can increase the number of components. Al- 
though deleting an edge can only increase the number of components by 1, 
deleting a vertex can increase it by many (consider the biclique Kj,,,). When 
we obtain a subgraph by deleting a vertex, it must be a graph, so deleting the 
vertex also deletes all edges incident to it. 


1.2.12. Definition. A cut-edge or cut-vertex of a graph is an edge or vertex 
whose deletion increases the number of components. We write G — e or 
G — M for the subgraph of G obtained by deleting an edge e or set of edges 
M. We write G — v or G — S for the subgraph obtained by deleting a vertex 
v or set of vertices S. An induced subgraph is a subgraph obtained by 
deleting a set of vertices. We write G[T] for G — T, where T = V(G) —T; 
this is the subgraph of G induced by T. 


When T C V(G), the induced subgraph G[T] consists of T and all edges 
whose endpoints are contained in 7. The full graph is itself an induced sub- 
graph, as are individual vertices. A set S of vertices is an independent set if 
and only if the subgraph induced by it has no edges. 


1.2.13. Example. The graph of Example 1.2.9 has cut-vertices v and y. Its cut- 
édges are gr, vw, xy, and yz. (When we delete an edge, its endpoints remain.) 
This graph has C, and.Ps as subgraphs but not as induced subgraphs. The 
subgraph induced by {s, t,u, v} is a kite; the 4-vertex paths on these vertices 
are not induced subgraphs. The graph P, does occur as an induced subgraph; 
it is the subgraph induced by {s, ¢, v, w} (also by {s, u,v, w}). | 


Next we characterize cut-edges in terms of cycles. 


1.2.14. Theorem. An edge is a cut-edge if and only if it belongs to no cycle. 


Proof: Let e be an edge in a graph G (with endpoints x, y), and let H be the com- 
ponent containing e. Since deletion of e affects no other component, it suffices 
to prove that H — e is connected if and only if e belongs to a cycle. 

First suppose that H — e is connected. This implies that H — e contains an 
x, y-path, and this path completes a cycle with e. 

Now suppose that e lies in a cycle C. Choose u,v € V(H). Since H is 
connected, H has au, v-path P. If P does not contain e, then P exists in H — e. 
If P contains e, suppose by symmetry that x is between u and y on P. Since 
H — e contains a u, x-path along P, an x, y-path along C, and a y, v-path along 
P, the transitivity of the connection relation implies that H — e has a u, v-path. 
We did this for all u, v € V(H), so H — e is connected. ] 


24 Chapter 1: Fundamental Concepts 


BIPARTITE GRAPHS 


Our next goal is to characterize bipartite graphs using cycles. Characteri- 
zations are equivalence statements, like Theorem 1.2.14. When two conditions 
are equivalent, checking one also yields the other for free. 

Characterizing a class G by a condition P means proving the equivalence 
“G € Gif and only if G satisfies P”. In other words, P is both a necessary and 
a sufficient condition for membership in G. 


Sufficiency 


G € Gif G satisfies P 
G satisfies P>GeG 


Recall that a loop is a cycle of length 1; also two distinct edges with the 
same endpoints form a cycle of length 2. A walk is odd or even as its length 
is odd or even. As in Lemma 1.2.5, a closed walk contains a cycle C if the 
vertices and edges of C occur as asublist of W, in cyclic order but not necessarily 
consecutive. We can think of a closed walk or a cycle as starting at any vertex; 
the next lemma requires this viewpoint. 


Necessity 


G € Gonly if G satisfies P 
G € G=> G satisfies P 


1.2.15. Lemma. Every closed odd walk contains an odd cycle. 


Proof: We use induction on the length/ of a closed odd walk W. 

Basis step: / = 1. A closed walk oflength 1 traverses a cycle of length 1. 

Induction step: / > 1. Assume the:claim for closed odd walks shorter than 
W. If W has no repeated vertex (other-than first = last), then W itself forms a 
cycle of odd length. If vertex v is repeated in W, then we view W as starting 
at v and break W into two v, v-walks. Since W has odd length, one of these is 
odd and the other is even. The odd one is shorter than W. By the induction 
hypothesis, it contains an odd cycle, and this cycle appears in orderinW. 5 


odd even 


1.2.16. Remark. A closed even walk need not contain a cycle; it may simply 
repeat. Nevertheless, if an edge e appears exactly once in a closed walk W, then 
W does contain a cycle through e. Let x, y be the endpoints of e. Deleting e 
from W leaves an x, y-walk that avoids e. By Lemma 1.2.5, this walk contains 
an x, y-path, and this path completes a cycle with e. (See Exercises 15-16.) 


Lemma 1.2.15 will help us characterize bipartite graphs. 


1.2.17. Definition. A bipartition of G is a specification of two disjoint inde- 
pendent sets in G whose union is V(G). The statement “Let G be a bipartite 
graph with bipartition X, Y” specifies one such partition. An X, Y-bigraph 
is a bipartite graph with bipartition X, Y. 
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The sets of a bipartition are partite sets (Definition 1.1.10). A disconnected 
bipartite graph has more than one bipartition. A connected bipartite graph has 
only one bipartition, except for interchanging the two sets (Exercise 7). 


1.2.18. Theorem. sani [1936]) A graph is bipartite if and only if it has no 
odd cycle. 


Proof: Necessity. Let G be a bipartite graph. Every walk alternates between 
the two sets of a bipartition, so every return to the original partite set happens 
after an even number of steps. Hence G has no odd cycle. 

Sufficiency. Let G be a graph with no odd cycle. We prove that G is bipartite 
by constructing a bipartition of each nontrivial component. Let u be a vertex in 
a nontrivial component H. For each v € V(#), let f(v) be the minimum length 
of a u, v-path. Since H is connected, f(v) is defined for each v € V(A). 

Let X = {v € V(H): f(v) is even} and Y = {v € V(A): f(v) is odd}. An 
edge v, v’ within X or Y would create a closed odd walk using a shortest u, v- 
path, the edge vv’, and the reverse of a shortest u, v'-path. By Lemma 1.2.15, 
such a walk must contain an odd cycle, which contradicts our hypothesis. Hence 
X and Y are independent sets. Also X UY = V(H), so H is an X, Y-bigraph. @ 
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1.2.19. Remark. Testing whether a graph is bipartite. Theorem 1.2.18 implies 
that whenever a graph G is not bipartite, we can prove this statement by pre- 
senting an odd cycle in G. This is much easier than examining all possible 
bipartitions to prove that none work. When we want to prove that G is bipar- 
tite, we define a bipartition and prove that the two sets are independent; this 
is easier than examining all cycles. | 


We consider one application. 


1.2.20. Definition. The union of graphs G),..., Gx, written G,U---U Gy, is 
the graph with vertex set _)t_, V(G;) and edge set U_, E(Gi). 


1.2.21. Example. Below we show Ky, as the union of two 4-cycles. When a 
graph G is expressed as the union of two or more subgraphs, an edge of G can 
belong to many of them. This distinguishes union from decomposition, where 


each edge belongs to only one subgraph in the list. | 
01 11 
00 10 
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1.2.22. Example. Consider an air traffic system with k airlines. Suppose that 
1) direct service between two cities means round-trip direct service, and 
2) each pair of cities has direct service from at least one airline. 
Suppose also that no airline can schedule a cycle through an odd number of 
cities. In terms of k, what is the maximum number of cities in the system? 
By Theorem 1.2.18, we seek the largest n such that K,, can be expressed as 
the union of k bipartite graphs, one for each airline. The answer is 2*. | 


1.2.23. Theorem. The complete graph K,, can be expressed as the union of k 
bipartite graphs if and only if n < 2*. 


Proof: We use induction on k. Basis step: k = 1. Since K3 has an odd cycle and 
Ko does not, K,, is itself a bipartite graph if and only ifn < 2. 

Induction step: k > 1. We prove each implication using the induction 
hypothesis. Suppose first that K, = G; U---U G,;, where each G; is bipartite. 
We partition the vertex set into two sets X, Y such that G, has no edge within 
X or within Y. The union of the other k — 1 bipartite subgraphs must cover the 
complete subgraphs induced by X and by Y. Applying the induction hypothesis 
to each yields |X| < 2‘! and |Y| < 24-1, son < 24-14 2-1 = 2k. 

Conversely, suppose that n < 2‘. We partition the vertex set into subsets 
X,Y, each of size at most 2‘, By the induction hypothesis, we can cover the 
complete subgraph induced by either subset with k — 1 bipartite subgraphs. 
The union of the ith such subgraph on:X with the ith such subgraph on Y is a 
bipartite graph. Hence we obtain k — 1 bipartite graphs whose unioz consists 
of the complete subgraphs induced by X and Y. The remaining edges are those 
of the biclique with bipartition X, Y. Letting this be the kth bipartite subgraph 
completes the construction. . | 


This theorem can also be proved without induction by encoding the vertices 
as binary k-tuples (Exercise 31). 


EULERIAN CIRCUITS 


We return to our analysis of the Kénigsberg Bridge Problem. What the 
people of Kénigsberg wanted was a closed trail containing all the edges in a 
graph. As we have observed, a necessary condition for existence of such a trail 
is that all vertex degrees be even. Also it is necessary that all edges belong to 
the same component of the graph. 

The Swiss mathematician Leonhard Euler (pronounced “oiler”) stated 
[1736] that these conditions are also sufficient. In honor of his contribution, 
we associate his name with such graphs. Euler’s paper appeared in 1741 but 
gave no proof that the obvious necessary conditions are sufficient. Hierholzer 
[1873] gave the first complete published proof. The graph we drew in Example 
1.1.1 to model the city did not appear in print until 1894 (see Wilson [1986] for 
a discussion of the historical record). 
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1.2.24. Definition. A graph is Eulerian if it has a closed trail containing all 
edges. We call a closed trail a circuit when we do not specify the first ver- 
tex but keep the list in cyclic order. An Eulerian circuit or Eulerian 
trail in a graph is a circuit or trail containing all the edges. 

An even graph is a graph with vertex degrees all even. A vertex is 
odd [even] when its degree is odd [even]. 


Our discussion of Eulerian circuits applies also to graphs with loops; we 
extend the notion of vertex degree to graphs with loops by letting each loop 
contribute 2 to the degree of its vertex. This does not change the parity of 
the degree, and the presence of a loop does not affect whether a graph has an 
Eulerian circuit unless it is a loop in a component with one vertex. 

Our proof of the characterization of Eulerian graphs uses a lemma. A 
maximal path in a graph G is a path P in G that is not contained in a longer 
path. When a graph is finite, no path can extend forever, so maximal (non- 
extendible) paths exist. 


1.2.25. Lemma. If every vertex of a graph G has degree at least 2, then G 
contains a cycle. 


Proof: Let P be a maximal path in G,and let u be an endpoint of P. Since P 
cannot be extended, every neighbor of'u must already be a vertex of P. Since 
u has degree at least 2, it has a neighbor v in V(P) via an edge not in P. The 
edge uv completes a cycle with the portion of P from v to u. | 


= 
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Note the importance of finiteness. If V(G) = Z and .E(G) = {ij: |i — j| = 1}, 
then every vertex of G has degree 2, but G has no cycle (and no non-extendible 
path). We avoid such examples by assuming that all graphs in this book are 
finite, with rare explicit exceptions. 


1.2.26. Theorem. A graph G is Eulerian if and only if it has at most one 
nontrivial component and its vertices all have even degree. 


Proof: Necessity. Suppose that G has an Eulerian circuit C. Each passage of C 
through a vertex uses two incident edges, and the first edge is paired with the 
last at the first vertex. Hence every vertex has even degree. Also, two edges 
can be in the same trail only when they lie in the same component, so there is 
at most one nontrivial component. 

Sufficiency. Assuming that the condition holds, we obtain an Eulerian 
circuit using induction on the number of edges, m. 

Basis step: m = 0. A closed trail consisting of one vertex suffices. 

Induction step: m > 0. With even degrees, each vertex in the nontrivial 
component of G has degree at least 2. By Lemma 1.2.25, the nontrivial compo- 
nent has a cycle C. Let G’ be the graph obtained from G by deleting E(C). 
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Since C has 0 or 2 edges at each vertex, each component of G’ is also an 
even graph. Since each component also is connected and has fewer than m 
edges, we can apply the induction hypothesis to conclude that each component 
of G’ has an Eulerian circuit. To combine these into an Eulerian circuit of G, 
we traverse C, but when a component of G’ is entered for the first time we 
detour along an Eulerian circuit of that component. This circuit ends at the 
vertex where we began the detour. When we complete the traversal of C, we 
have completed an Eulerian circuit of G. | 


Perhaps as important as the characterization of Eulerian graphs is what 
the method of proof says about even graphs. 


1.2.27. Proposition. Every even graph decomposes into cycles. 
Proof: In the proof of Theorem 1.2.26, we noted that every even nontrivial 


graph has a cycle, and that the deletion of a cycle leaves an even graph. Thus 
this proposition follows by induction on the number of edges. | 


I 


n 


In the characterization of Eulerian circuits, the necessity of the condition 
is easy to see. This also holds for the characterization of bipartite graphs by 
absence of odd cycles and for many other characterizations. Nash-Williams and 
others popularized a mnemonic for such theorems: TONCAS, meaning “The 
Obvious Necessary Conditions are Also Sufficient”. 

The proof of Lemma 1.2.25 is an example of an important technique of 
proof in graph theory that we call extremality. When considering structures 
of a given type, choosing an example that is extreme in some sense may yield 
useful additional information. For example, since a maximal path P cannot be 
extended, we obtain the extra information that every neighbor of an endpoint 
- of P belongs to V(P). 

In a sense, making an extremal choice goes directly to the important case. 
In Lemma 1.2.25, we could start with any path. If it is extendible, then we ex- 
tend it. If not, then something important happens. We illustrate the technique 
with several examples, and Exercises 37-42 also use extremality. We begin by 
strengthening Lemma 1.2.25 for simple graphs. 


1.2.28. Proposition. If G is a simple graph in which every vertex has degree 
at least k, then G contains a path of length at least k. Ifk > 2, then G also 
contains a cycle of length at least k + 1. 


Proof: Let u be an endpoint of a maximal path P in G. Since P does not extend, 
every neighbor of u is in V(P). Since u has at least k neighbors and G is simple, 
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P therefore has at least k vertices other than u and has length at least k. If 
k > 2, then the edge from u to its farthest neighbor v along P completes a 
sufficiently long cycle with the portion of P from v to u. a 


os. oe 


Uu Vv 


1.2.29. Proposition. Every graph with a nonloop edge has at least two vertices 
that are not cut-vertices. 


Proof: If u is an endpoint of a maximal path P in G, then the neighbors of u lie 
on P. Since P — u is connected in G — u, the neighbors of u belong to a single 
component of G — u, and u is not a cut-vertex. | 


1.2.30. Remark. Note the difference between “maximal” and “maximum”. 
As adjectives, maximum means “maximum-sized”, and maximal means “no 
larger one contains this one”. Every maximum path is a maximal path, but 
maximal paths need not have maximum length. Similarly, the biclique K,., has 
two maximal independent sets, but when r # s it has only one maximum inde- 
pendent set. When describing numbers rather than containment, the meanings 
are the same; maximum vertex degree = maximal vertex degree. 

Besides maximal or maximum paths or independent sets, other extremal 
aspects include vertices of minimum or maximum degree, the first vertex where 
two paths diverge, maximal connected:subgraphs (components), etc. In a con- 
nected graph G with disjoint sets S$, 7 C V(G), we can obtain a path from S to 
T having only its endpoints in S UT by choosing a shortest path from S to 7; 
Exercise 40 applies this. Exercise 37 uses extremality for a short proof of the 
transitivity of the connection relation. a 


Many proofs using induction can be phrased'using extremality, and many 
proofs using extremality can be done by induction. To underscore the interplay, 
we reprove the characterization of Eulerian graphs using extremality directly. 


1.2.31. Lemma. In an even graph, every maximal trail is closed. 


Proof: Let T be a maximal trail in an even graph. Every passage of 7 through 
a vertex v uses two edges at v, none repeated. Thus when arriving at a vertex 
v other than its initial vertex, T has used an odd number of edges incident to 
v. Since v has even degree, there remains an edge on which T can continue. 
Hence 7 can only end at its initial vertex. In a finite graph, 7 must indeed 
end. We conclude that a maximal trail must be closed. | 


1.2.32. Theorem 1.2.26—Second Proof. We prove TONCAS. In a graph G 
satisfying the conditions, let T be a trail of maximum length; T must also be a 
maximal trail. By Lemma 1.2.31, T is closed. 

Suppose that T omits some edge e of G. Since G has only one nontrivial 
component, G has a shortest path from e to the vertex set of 7. Hence some 
edge e’ not in T is incident to some vertex v of T. 
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Since 7 is closed, there-is a trail T’ that starts and ends at v and uses the 
same edges as T. We now extend 7” along e’ to obtain a longer trail than T. 
This contradicts the choice of 7, and hence 7 traverses all edges of G. a 


This proof and the resulting construction procedure (Exercise 12) are sim- 
ilar to those of Hierholzer [1873]. Exercise 35 develops another proof. 

Later chapters contain several applications of the statement that every 
connected even graph has an Eulerian circuit. Here we give a simple one. 
Wher drawing a figure G on paper, how many times must we stop and move 
the pen? We are not allowed to repeat segments of the drawing, so each visit 
to the paper contributes a trail. Thus we seek a decomposition of G into the 
minimum number of trails. We may reduce the problem to connected graphs, 
since the number of trails needed to draw G is the sum of the number needed 
to draw each component. 

For example, the graph G below has four odd vertices and decomposes into 
two trails. Adding the dashed edges on the right makes it Eulerian. 


5 


1.2.33. Theqrem. For a connected nontrivial graph with exactly 2k odd ver- 
tices, the minimum number of trails that decompose it is max{k, 1}. 
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Proof: A trail contributes even degree to every vertex, except that a non-closed 
trail contributes odd degree to its endpoints. Therefore, a partition of the edges 
into trails must have some non-closed trail ending at each odd vertex. Since 
each trail has only two ends,’we must use at least k trails to satisfy 2k odd 
vertices. We also need at least one trail since G has an edge, and Theorem 
1.2.26 implies that one trail suffices when k = 0. 

It remains to prove that k trails suffice when k > 0. Given such a graph G, 
we pair up the odd vertices in G (in any way) and form G’ by adding for each 
pair an edge joining its two vertices, as illustrated above. The resulting graph 
G’ is connected and even, so by Theorem 1.2.26 it has an Eulerian circuit C. As 
we traverse C in G’, we start a new trail in G each time we traverse an edge of 
G' — E(G). This yields k trails decomposing G. | 


We prove theorems in general contexts to avoid work. The proof of Theorem 
1.2.33 illustrates this; by transforming G into a graph where Theorem 1.2.26 
applies, we avoid repeating the basic argument of Theorem 1.2.26. Exercise 33 
requests a proof of Theorem 1.2.33 directly by induction. 

Note that Theorem 1.2.33 considers only graphs having an even number of 
vertices of odd degree. Our first result in the next section explains why. 
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EXERCISES 


Most problems in this book require proofs. Words like “construct”, “show”, “ob- 
tain”, “determine”, etc., explicitly state that proof is required. Disproof by providing a 
counterexample requires confirming that it is a counterexample. 


1.2.1. (—) Determine whether the statements below are true or false. 
a) Every disconnected graph has an isolated vertex. 
b) A graph is connected if and only if some vertex is connected to all other vertices. 
c) The edge set of every closed trail can be partitioned into edge sets of cycles. 
d) If a maximal trail in a graph is not closed, then its endpoints have odd degree. 


1.2.2. (—) Determine whether K, contains the following (give an example or a proof of 
non-existence). 

a) A walk that is not a trail. 

b) A trail that is not closed and is not a path. 

c) A closed trail that is not a cycle. 


1.2.3. (—) Let G be the graph with vertex set {1,..., 15} in which i and j are adjacent 
if and only if their greatest common factor exceeds 1. Count the components of G and 
determine the maximum length of a path in G. 


1.2.4. (—) Let G be a graph. For v € V(G) and e € E(G), describe the adjacency and 
incidence matrices of G — v and G — ¢ in terms of the corresponding matrices for G. 


1.2.5. (—) Let v be a vertex of a connected ‘simple graph G. Prove that v has a neighbor 
in every component of G — v. Conclude that no graph has a cut-vertex of degree 1. 


1.2.6. (—) In the graph below (the paw), find all the maximal paths, maximal cliques, 
and maximal independent sets. Also find all the maximum paths, maximum cliques, 
and maximum independent sets. 2 
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1.2.7. (—) Prove that a bipartite graph has a unique bipartition (except for interchang- 
ing the two partite sets) if and only if it is connected. 


1.2.8. (—) Determine the values of m and n such that K,,., is Eulerian. 


1.2.9. (—) What is the minimum number of trails needed to decompose the Petersen 
graph? Is there a decomposition into this many trails using only paths? 


1.2.10. (—) Prove or disprove: 

a) Every Eulerian bipartite graph has an even number of edges. 

b) Every Eulerian simple graph with an even number of vertices has an even num- 
ber of edges. 


1.2.11. (—) Prove or disprove: If G is an Eulerian graph with Aes e, f that share a 
vertex, then G has an Eulerian circuit in which e, f appear consecutively. 


1.2.12. (—) Convert the proof at 1.2.32 to an procedure for finding an Eulerian circuit 
in a connected even graph. 
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1.2.13. Alternative proofs that every u, v-walk contains a u, v-path (Lemma 1.2.5). 

a) (ordinary induction) Given that every walk of length / — 1 contains a path from 
its first vertex to its last, prove that every walk of length / also satisfies this. 

b) (extremality) Given a u, v-walk W, consider a shortest u, v-walk contained in W. 


1.2.14. Prove or disprove the following statements about simple graphs. (Comment: 
“Distinct” does not mean “disjoint”.) 

a) The union of the edge sets of distinct u, v-walks must contain a cycle. 

b) The union of the edge sets of distinct u, v-paths must contain a cycle. 


1.2.15. (!) Let W be a closed walk of length at least 1 that does not contain a cycle. 
Prove that some edge of W repeats immediately (once in each direction). 


1.2.16. Let e be an edge appearing an odd number of times in a closed walk W. Prove 
that W contains the edges of a cycle through e. 


1.2.17. (!) Let G,, be the graph whose vertices are the permutations of {1,...,}, with 
two permutations a),...,a, and b;,...,b, adjacent if they differ by interchanging a pair 
of adjacent entries (G3 shown below). Prove that G,, is connected. 


123 213 


132 231 


312 321 
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1.2.18. (!) Let G be the graph whose vertex set is the set of k-tuples with elements in 
{0,1}, with x adjacent to y if x and y differ i in exactly two positions. Determine the 
number of components of G. 


tele 


1.2.19. Let r and s be natural numbers. Let G be the simple graph with vertex set 
Uo, --+, U,-1 Such that v; — v; if and only if |j —i| € {r,s}. Prove that S has exactly k 
components, where k is the greatest common divisor of {n, r, 5}. 


1.2.20. (!) Let v be a cut-vertex of a simple graph G. Prove that G — v is connected. 


1.2.21. Let G be a self-complementary graph. Prove that G has.a cut-vertex if and only 
if G has a vertex of degree 1. (Akiyama—Harary [1981]) 


1.2.22. Prove that a graph is connected if and only if for every partition of its vertices 
into two nonempty sets, there is an edge with endpoints in both sets. 


1.2.23. For each statement below, determine whether it is true for every connected 
simple graph G that is not a complete graph. 

a) Every vertex of G belongs to an induced subgraph isomorphic to P3. 

b) Every edge of G belongs to an induced subgraph isomorphic to P3. 


1.2.24, Let G be a simple graph having no isolated vertex and no induced subgraph 
with exactly two edges. Prove that G is a complete graph. 


1.2.25. (!) Use ordinary induction on the number of edges to prove that absence of odd 
cycles is a sufficient condition for a graph to be bipartite. 


1.2.26. (!) Prove that a graph G is bipartite if and only if every subgraph H of G has an 
independent set consisting of at least half of V(H). 
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1.2.27. Let G, be the graph whose vertices are the permutations of {1,..., n}, with two 
permutations a,...,d@, and by,...,b, adjacent if they differ by switching two entries. 
Prove that G,, is bipartite (G3 shown below). (Hint: For each permutation a, count the 
pairs i, j such that i < j and a; > a;; these are called inversions.) 


123 213 


132 231 


312 321 


1.2.28. (!) In each graph below, find a bipartite subgraph with the maximum number of 
edges. Prove that this is the maximum, and determine whether this is the only bipartite 
subgraph with this many edges. 


a= OO 


1.2.29. (!) Let G be a connected simple graph not having P, or C3 as an induced sub- 
graph. Prove that G is a biclique (complete bipartite graph). 


1.2.30. Let G be a simple graph with vertices v;,..., v,. Let A‘ denote the kth power of 
the adjacency matrix of G under matrix multiplication. Prove that entry i, j of A‘ is the 
number of v;, vj-walks of length k in G. Prove that G is bipartite if and only if, for the 
odd integer r nearest to n, the diagonal entries of A’ are all 0. (Reminder: A walk is an 
ordered list of vertices and edges.) 2 


1.2.31. (!) Non-inductive proof of Theorem 1.2.23 (see Example 1.2.21). 

a) Given n < 2*, encode the vertices of K, as distinct binary k-tuples. Use this to 
construct k bipartite graphs whose union is K,,. 

b) Given that K, is a union of bipartite graphs Gj, ..., G,, encode the vertices of K,, 
as distinct binary k-tuples. Use this to prove that n < 2*. 


1.2.32. The statement below is false. Add a hypothesis to correct it, and prove the 
corrected statement. 
“Every maximal trail in an even graph is an Eulerian circuit.” 


1.2.33. Use ordinary induction on k or on the number of edges (one by one) to prove 
that a connected graph with 2k-odd vertices decomposes into k trails if k > 0. Does this 
remain true without the connectedness hypothesis? 


1.2.34. Two Eulerian circuits are equivalent if they have the same unordered pairs of 
-consecutive edges, viewed cyclically (the starting point and direction are unimportant). 
A cycle, for example, has only one equivalence class of Eulerian circuits. How many 
equivalence classes of Eulerian circuits are there in the graph drawn below? 
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1.2.35. Tucker’s Algorithm. Let G be a connected even graph. At each vertex, partition 
the incident edges into pairs (each edge appears in a pair for each of its endpoints). 
Starting along a given edge e, form a trail by leaving each vertex along the edge paired 
with the edge just used to enter it, ending with the edge paired with e. This decomposes 
G into closed trails. As long as there is more than one trail in the decomposition, find 
two trails with a common vertex and combine them into a longer trail by changing the 
pairing at a common vertex. Prove that this procedure works and produces an Eulerian 
circuit as its final trail. (Tucker [1976]) 


1.2.36. (+) Alternative characterization of Eulerian graphs. 

a) Prove that if G is Eulerian and G’ = G — uv, then G’ has an odd number of u, v- 
trails that visit v only at the end. Prove also that the number of the trails in this list 
that are not paths is even. (Toida [1973]) 

b) Let v be a vertex of odd degree in a graph. For each edge e incident to v, let c(e) 
be the number of cycles containing e. Use ». c(e) to prove that c(e) is even for some e 
incident to v. (McKee [1984]) 

c) Use part (a) and part (b) to conclude that a nontrivial connected graph is Eulerian 
if and only if every edge belongs to an odd number of cycles. 


1.2.37. (!) Use extremality to prove that the connection relation is transitive. (Hint: 
Given au, v-path P and a v, w-path Q, consider the first vertex of P in Q.) 


1.2.38. (!) Prove that every n-vertex graph with at least n edges contains a cycle. 


1.2.39. Suppose that every vertex of a loopless graph G has degree at least 3. Prove 
that G has a cycle of even lengin. (Hint: Consider a maximal path.) (P. Kwok) 


1.2.40. (!) Let P and Q be paths of maximum length in a connected graph G. Prove 
that P and Q have a common vertex. 


1.2.41. Let G be a connected graph with: at least three vertices. Prove that G has 
two vertices x, y such that 1) G — {x, y} is connected and 2) x, y are adjacent or have a 
common neighbor. (Hint: Consider a longest path.) (Chung [1978a]) 
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1.2.42. Let G be a connected simple graph that does not have P, or C, as an induced 
subgraph. Prove that G has a vertex adjacent to all other vertices. (Hint: Consider a 
vertex of maximum degree.) (Wolk [1965]) 


1.2.43. (+) Use induction on k to prove that every connected simple graph with an even 
number of edges decomposes into paths of length 2. Does the conclusion remain true if 
the hypothesis of connectedness is omitted? 


1.3. Vertex Degrees and Counting 


The degrees of the vertices are fundamental parameters of a graph. We 
repeat the definition in order to introduce important notation. 


1.3.1. Definition. The degree of vertex v in a graph G, written dg(v) or d(v), 
is the number of edges incident to v, except that each loop at v counts twice. 
The maximum degree is A(G), the minimum degree is 5(G), and G is reg- 
ular if A(G) = 6(G). It is k-regular if the common degree is k. The neigh- 
borhood of v, written Ng(v) or N(v), is the set of vertices adjacent to v. 
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1.3.2. Definition. The order of a graph G, written n(G), is the number of 
vertices in G. An n-vertex graph is a graph of order n. The size ofa 
graph G, written e(G), is the number of edges in G. For n € N, the notation 
[n] indicates the set {1,..., n}. 


Since our graphs are finite, n(G) and e(G) are well-defined nonnegative 
integers. We also often use “e” by itself to denote an edge. When e denotes a 
particular edge, it is not followed by the name of a graph in parentheses, so 
the context indicates the usage. We have used “n-cycle” to denote a cycle with 
n vertices; this is consistent with “n-vertex graph”. 


COUNTING AND BIJECTIONS 


We begin with counting problems about subgraphs in a graph. The first 
such problem is to count the edges; we do this using the vertex degrees. The 
resulting formula is an essential tool of graph theory, sometimes called the 
“First Theorem of Graph Theory” or the “Handshaking Lemma”. 


1.3.3. Proposition. (Degree-Sum Formula) If G is a graph, then 


eviG) dv) = 2e(G). 
Proof: Summing the degrees counts each edge twice, since each edge has two 
ends and contributes to the degree at ach endpoint. | 
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The proof holds even when G has = since a loop contributes 2 to the 
degree of its endpoint. For a loopless graph, the two sides of the formula count 
the set of pairs (v, e) such that v is an endpoint of e, grouped by vertices or 
grouped by edges. “Counting two ways” is an elegant technique for proving 
integer identities (see Exercise 31 and Appendix A). 

The degree-sum formula has several immediate corollaries. Corollary 1.3.5 
applies in Exercises 9-13 and in many arguments of later chapters. 


1.3.4. Corollary. In a graph G, the average vertex degree is a , and hence 
5(G) < #2 < ACG). i 
1.3.5. Corollary. Every graph has an even number of vertices of odd degree. 
No graph of odd order is regular with odd degree. a 
1.3.6. Corollary. A k-regular graph with n vertices has nk/2 edges. | 


We next introduce an important family of graphs. 


1.3.7. Definition. The k-dimensional cube or hypercube (Q, is the sim- 
ple graph whose vertices are the k-tuples with entries in {0, 1} and whose 
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edges are the pairs of k-tuples that differ in exactly one position. A j/- 
dimensional subcube of Q,; is a subgraph of Q; isomorphic to Q;. 


011 111 
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Above we show Q3. The hypercube is a naturai computer architecture. 
Processors can communicate directly if they correspond to adjacent vertices in 
Q,. The k-tuples that name the vertices serve as addresses for the processors. 


1.3.8. Example. Structure of hypercubes. The parity of a vertex in Q, is the 
parity of the number of 1s in its name, even or odd. Each edge of Q; has an 
even vertex and an odd vertex as endpoints. Hence the even vertices form an 
independent set, as do the odd vertices, and Q, is bipartite. 

Each position in the k-tuples can be specified in two ways, son(Q;) = 2‘. A 
neighbor of a vertex is obtained by changing one of the k positions in its name 
to the other value. Thus Q, is k-regular. By Corollary 1.3.6, e(Q,) = k2*"). 

The bold edges above show two subgraphs of Q3 isomorphic to Qe, formed 
by keeping the last coordinate fixed at-0 or at 1. We can form a j-dimensional 
subcube by keeping any k — j coordinates fixed and letting the values in the 
remaining j coordinates range over all 2/ possible j-tuples. The subgraph in- 
duced by such a set of vertices is isomorphic to Q;. Since there are (‘) ways 


to pick j coordinates to vary and 2‘~/ ways to specify the values in the fixed 
coordinates, this specifies (j)2k-3 such subcubes. In fact, there are no other 
j-dimensional subcubes (Exercise 29). 

The copies of Q; are simply the edges in Q,. Our formula reduces to k2‘—} 
when j = 1, so we have found another counting argument to compute e(Q,). 

When j = k — 1, our discussion suggests a recursive description of Q,. 
Append 0 to the vertex names in a copy of Q;-1; append 1 in another copy. 
Add edges joining vertices from the two copies whose first k — 1 coordinates are 
equal. The result is Q,. The basis of the construction is the 1-vertex graph Qo. 
This description leads to inductive proofs for many properties of hypercubes, 
including e(Q,) = k2‘~! (Exercise 23). a 


A hypercube is a regular bipartite graph. A simple counting argument 
proves a fundamental observation about such graphs. 


1.3.9. Proposition. If k > 0, then a k-regular bipartite graph has the same 
number of vertices in each partite set. 
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Proof: Let G be an X, Y-bigraph. Counting the edges according to their end- 
points in X yields e(G) = k |X|. Counting them by their endpoints in Y yields 
e(G) =k |Y|. Thus k |X| = k|Y|, which yields |X| = |Y| when k > 0. | 


Another technique for counting a set is to establish a bijection from it to 
a set of known size. Our next example uses this approach. Other examples of 
combinatorial arguments for counting problems appear in Appendix A. Exer- 
cises 18—35 involve counting. 


1.3.10. Example. The Petersen graph has ten 6-cycles. Let G be the Petersen 
graph. Being 3-regular, G has ten claws (copies of K1.3). We establish a one-to- 
one correspondence between the 6-cycles and the claws. 

Since G has girth 5, every 6-cycle F is an induced subgraph. Each vertex 
of F has one neighbor outside F. Since nonadjacent vertices have exactly one 
common neighbor (Proposition 1.1.38), opposite vertices on F have a common 
neighbor outside F. Since G is 3-regular, the resulting three vertices outside 
F are distinct. Thus deleting.V(F) leaves a subgraph with three vertices of 
degree 1 and one vertex of degree 3; it is a claw. 


We show that each claw H in G arises exactly once in this way. Let S be the 
set of vertices with degree 1 in H; S is an independent set. The central vertex 
of H is already a common neighbor, so the six other edges from S reach distinct 


vertices. Thus G — V(H) is 2-regular. Since G has girth 5, G — V(H) must be a 
6-cycle. This 6-cycle yields H when its vertices are deleted. a 


We present one more counting argument related to a long-standing conjec- 
ture. Subgraphs obtained by deleting a single vertex are called vertex-deleted 
subgraphs. These subgraphs need not all be distinct; for example, the n vertex- 
deleted subgraphs of C,, are all isomorphic to P,,_1. 


1.3.11.* Proposition. For a simple graph G with vertices v;,..., v, andn > 3, 
Le(G — i Le(G — i 
ios aad ee. 
n—2 n—-2 


Proof: An edge e of G appears in G — v; if and only if v; is not an endpoint of e. 
Thus )'(G — v;) counts each edge exactly n — 2 times. 

Once we know e(G), the degree of v; can be computed as the number of 
edges lost when deleting v; to form G — vj. a 
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Typically, we are given the vertex-deleted subgraphs as unlabeled graphs; 
we know only the list of isomorphism classes, not which vertex of G — v; cor- 
responds to which vertex in G. This can make it very difficult to tell what G 
is. For example, Kz and its complement have the same list of vertex-deleted 
subgraphs. For larger graphs we have the Reconstruction Conjecture, for- 
mulated in 1942 by Kelly and Ulam. 


1.3.12.* Conjecture. (Reconstruction Conjecture) If G is a simple graph with 
at least three vertices, then G is uniquely determined by the list of (iso- 
morphism classes of) its vertex-deleted subgraphs. a 


The list of vertex-deleted subgraphs of G has n(G) items. Proposition 1.3.11 
shows that e(G) and the list of vertex degrees can be reconstructed. The lat- 
ter implies that regular graphs can be reconstructed (Exercise 37). We can 
also determine whether G is connected (Exercise 38); using this, disconnected 
graphs can be reconstructed (Exercise 39). Other sufficient conditions for re- 
constructibility are known, but the general conjecture remains open. 


EXTREMAL PROBLEMS 


1_books 


An extremal problem asks for the maximum or minimum value of a func- 
tion over a class of objects. For example, the maximum number of edges in a 
simple graph with n vertices is (5). 


telegram. 


1.3.13. Proposition. The minimum number of edges in a connected graph 
with n vertices is n — 1. 


Proof: By Proposition 1.2.11, every graph with n vertices and k edges has at 
least n—k components. Hence every n-vertex graph with fewer than n — 1 edges 
has at least two components and is disconnected. The contrapositive of this is 
that every connected n-vertex graph has at least n — 1 edges. This lower bound 
is achieved by the path P,,. | 


1.3.14. Remark. Proving that £ is the minimum of {(G) for graphs in a class 
G requires showing two things: 

1) f(G) > B forallG eG. 

2) f(G) = B for some G € G. 
The proof of the bound must apply to every G € G. For equality it suffices to 
obtain an example in G with the desired value of /. 

Changing “>” to “<” yields the criteria for a maximum. | 


Next we solve a maximization problem that is not initially phrased as such. 


1.3.15. Proposition. If G is a simple n-vertex graph with 5(G) > (n — 1)/2, 
then G is connected. 
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Proof: Choose u, v € V(G). It suffices to show that u, v have acommon neighbor 
if they are not adjacent. Since G is simple, we have |N(u)| > 6(G) > (n — 1)/2, 
and similarly for v. When u # v, we have |N(u) U N(v)| < n — 2, since u and v 
are not in the union. Using Remark A.13 of Appendix A, we thus compute 


IN(u) A N(v)| = |N(w)| + |N(v)| — INC) U N(v)| = 354 +454 -(@-2)=1. © 


We say that a result is best possible or sharp when there is some aspect 
of it that cannot be strengthened without the statement becoming false. As 
shown by the next example, this holds for Proposition 1.3.15; when 6(G) is 
smaller than (n(G) — 1)/2, we cannot still conclude that G must be connected. 


1.3.16. Example. Let G be the n-vertex graph with components isomorphic to 
K\n/2| and Kjn/2;, where the floor |x| of x is the largest integer at most x and 
the ceiling [x] of x is the smallest integer at least x. Since 6(G) = |n/2| —1 
and G is disconnected, the inequality in Proposition 1.3.15 is sharp. 

We use the floor and ceiling functions here in order to describe a single 
family of graphs providing an example for each n. | 


K\n/2! ) Kin/2] 
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Ry providing a family of examples-to show that the bound is best possible, 
we have solved an extremal problem. Together, Proposition 1.3.15 and Example 
1.3.16 prove “The minimum value of 5(G) that forces an n-vertex simple graph 
G to be connected is [n/2],” or “The maximum value of 6(G) among disconnected 
n-vertex simple graphs is |n/2]| — 1.” 

We introduce compact notation to describe the graph of Example 1.3.16. 


1.3.17. Definition. The graph obtained by taking the union of graphs G and 
H with disjoint vertex sets is the disjoint union or sum, written G + H. 
In general, mG is the graph consisting of m pairwise disjoint copies of G. 


1.3.18. Example. If G and H are connected, then G+ H has components G and 
H, so the graph in Example 1.3.16 is K,/2) + K[n/2}. This notation is convenient 
when we have not named the vertices. Note that K, + K, = Kmn-- 

The graph mKg consists of m pairwise disjoint edges. | 


In graph theory, we use “extremal problem” for finding an optimum over a 
class of graphs. When seeking extremes in a single graph, such as the maxi- 
mum size of an independent set, or maximum size of a binartite subgraph, we 
have a different problem for each graph. To distinguish these from the earlier 
type of problem, we call them optimization problems. 

Since an optimization problem has an instance for each graph, we usualiy 
can’t list all solutions. We may seek a solution procedure or bounds on the 
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answer in terms of other aspects of the input graph. In this light we consider 
the problem of finding a large bipartite subgraph. It allows us to introduce the 
technique of constructive or “algorithmic” proof. (An algorithm is a procedure 
for performing some task.) 

One way to prove that something exists is to build it. Such proofs can be 
viewed as algorithms. To complete an algorithmic proof, we must prove that 
the algorithm terminates and yields the desired result. This may involve in- 
duction,contradiction, finiteness, etc. We prove that every graph has a large 
bipartitesubgraph by providing an algorithm to find one. Exercises 45—49 ar- 
erelated to finding large bipartite subgraphs. 


1.3.19. Theorem. Every loopless graph G has a bipartite subgraph with at 
least e(G)/2 edges. 


Proof: We start with any partition of V(G) into two sets X, Y. Using the edges 
having one endpoint in each set yields a bipartite subgraph H with bipartition 
X,Y. If H contains fewer than half the edges of G incident to a vertex v, then v 
has more edges to vertices in its own class than in the other class, as illustrated 
below. Moving v to the other class gains more edges of G than it loses. 

We move one vertex in this way as long as the current bipartite subgraph 
captures less than half of the edges at some vertex. Each such switch increases 
the size of the subgraph, so the precess must terminate. When it terminates, 
we have dy(v) > dg(v)/2 for every v e-V(G). Summing this and applying the 
degree-sum formula yields e(H) > e(G)/2. | 


al 


Algorithmic proofs often correspond to proofs by induction or extremality. 
Such proofs are shorter and may be easier to find, so we may seek such a proof 
and later convert it to an algorithm. For example, here is the proof of Theorem 
1.3.19 in the language of extremality and contradiction; in effect, the extremal 
choice of H goes directly to the end of the algorithm: 


Let H be the bipartite subgraph of G that has the most edges. If 
dy(v) > dg(v)/2 for all v € V(G), then the degree-sum formula yields 
e(H) > e(G)/2. Otherwise, dy(v) < dg(v)/2 for some v € V(G), and 
then switching v in the bipartition contradicts the choice of H. 


1.3.20. Example. Local maximum. The algorithm in Theorem 1.3.19 need not 
produce a bipartite subgraph with the most edges, merely one with at least half 
the edges. The graph below is 5-regular with 8 vertices and hence has 20 edges. 
The bipartition X = {a,b,c,d} and Y = {e, f, g,h} yields a 3-regular bipartite 
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subgraph with 12 edges. The algorithm terminates here; switching one vertex 
would pick up two edges but lose three. 

Nevertheless, the bipartition X = {a, b, g,h} and Y = {c,d,e, f} yields a 4- 
regular bipartite subgraph with 16 edges. An algorithm seeking the maximum 
by local changes may get stuck in a lecal maximum. a 


1.3.21. Remark. In a graph G, the (global) maximum number cf edges in a bi- 
partite subgraph is e(G) minus the minimum number of edges needed to obtain 
at least one edge from every odd cycle. | 


Our next extremal problera doesn’t start with bipartite graphs, but it winds 
up there. In politics and warfare, seldom do two enemies have a common en- 
emy; usually two of the three combine against the third. Given n factions, how 
many pairs of enemies can there be if no two enemies have a common enemy? 

In the language of graphs, we are asking for the maximum number of edges 
in a simple n-vertex graph with no triangle. Bipartite graphs iyave no triangles, 
but also many non-bipartite graphs (such as the Petersen graph) have no tri- 
angles. Using extremality (by choosing a vertex of maximum degree), we will 
prove that the maximum is indeed achieved by a complete bipartite graph. 


1.3.22. Definition. A graph G is H-free if G has no induced subgraph isomor- 
phic to H. 


1.3.23. Theorem. (Mantel [1907]) The maximum number of edges in an n- 
vertex triangle-free simple graph is | n”/4|. 

Proof: Let G be an n-vertex triangle-free simple graph. Let x be a vertex of 

maximum degree, with k = a(x). Since G has no triangles, there are no edges 

among neighbors of x. Hence summing the degrees of x and its nonneighbors 

counts at least one endpoint cf every edge: paar nx a(v) = e(G). We sum over 

n —k vertices, each having degree at most k, so e(G) < (n —k)k. 


= — 
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Since (n — k)k counts the edges in K,_;,,, we have now proved that e(G) is 
bounded by the size of some biclique with n vertices. Moving a vertex of K,,x. 
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from the set of size k to the set of size n — k gains k — 1 edges and loses n — k 
edges. The net gain is 2k — 1 — n, which is positive for 2k > n + 1 and negative 
for 2k <n+1. Thus e(K,_,,) is maximized when k is [n/2] or |[n/2]. The 
product is then n?/4 for even n and (n? — 1)/4 for odd n. Thus e(G) < |n?/4|. 
To prove that the bound is best possible, we exhibit a triangle-free graph 
with | n?/4 | edges: K n/2},[n/21> | 


Although (n — k)k can be maximized over k using calculus, the discrete 
approach is preferable in some ways. it directly restricts k to be an integer 
and generalizes easily to more variables. The switching idea used is that of 
Theorem 1.3.19; here we have used it. to find the largest bipartite subgraph of 
K,,. In Theorem 5.2.9 we generalize Theorem 1.3.23 to K,,1-free graphs. 

Mantel’s result leads us to another reason for phrasing inductive proofs in 
the format that we have used. The reason is safety. 


1.3.24. Example. A failed proof. Let us try to prove Theorem 1.3.23 by induc- 
tion on n. Basis step: n < 2. Here the complete graph K,, has the most edges 
and has no triangles. 

Induction step: n > 2. We try “Suppose that the claim is true when n = k, 
So K\x/2),1k/2] 18 the largest triangle-free graph with k vertices. We add a new 
vertex x to form a triangle-free graph with k + 1 vertices. Making x adjacent 
to vertices from both partite sets would create a triangle. Hence we add the 
most edges by making x adjacent to all the vertices in the larger partite set of 
Kx /2),1k/2). Doing so creates Kj 4+1)/2), [e+ 1) 2}. This completes the proof.” 

This argument is wrong, because we did not consider all triangle-free 
graphs with k + 1 vertices. We considered only those containing the extremal k- 
vertex graph as an induced subgraph. This graph does appear in the extremal 
graph with k + 1 vertices, but we cannot use that fact before proving it. It re- 
mains possible that the largest example with k + 1 vertices arises by adding a 
new vertex of high degree to a non-maximal example with k vertices. 

Exercise 51 develops a correct proof by induction on n. a 


The error in Example 1.3.24 was that our induction step did not consider 
all instances of the statement for the new larger value of the parameter. We 
call this error the induction trap. If the induction step grows an instance 
with the new value of the parameter from a smailer instance, then we must 
prove that all instances with the new value have been considered. 

When there is only one instance for each value of the induction parameter 
(as in summation formulas), this does not cause trouble. With more than one 
instance, itis safer and simpler to start with an arbitrary instance for the larger 
parameter value. This explicitly considers each instance G for the larger value, 
so we don’t need to prove that we have generated them all. 

However, when we obtain from G a smaller instance, we must confirm that 
the induction hypothesis applies to it. For example, in the inductive proof of 
the characterization of Eulerian circuits (Theorem 1.2.26), we must apply the 
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induction hypothesis to each component of the graph obtained by deleting the 
edges of a cycle, not to the entire graph at once. 


1.3.25. Remark. A template for induction. Often the statement we want to 
prove by induction on n is an implication: A(n) > B(n). We must prove that 
every instance G satisfying A(n) also satisfies B(n). Our induction step follows 
a typical format. From G we obtain some (smaller) G’. If we show that G’ 
satisfies A(n — 1) (for ordinary induction), then the induction hypothesis implies 
that G’ satisfies B(n — 1). Now we use the information that G’ satisfies B(n — 1) 
to prove that G satisfies B(n). 


G satisfies A(n) G satisfies B(n) 
4 tt 
G’ satisfies A(n-—1) => G’ satisfies B(n — 1) 


Here the central implication is the statement of the induction hypothesis, and 
the others are the work we must do. Our induction proofs have followed this 
format. a 


‘1.3.26.* Example. The induction trap. The induction trap can lead to a false 
conclusion. Let us try to prove by induction on the number of vertices that 
every 3-regular connected simple graph has no cut-edge. 

By the degree-sum formula, every regular graph with odd degree has even 
order, so we consider graphs with 2m vertices. The smallest 3-regular simple 
graph, K4, is connected and has no cut-edge; this proves the basis step with 
m = 2. Now consider the induction step. 

Given a simple 3-regular graph G with 2k vertices, we can obtain a simple 
3-regular graph G’ with 2(k + 1) vertices (the next larger possible order) by 
“expansion”: take two edges of G, replace them by paths of length 2 through 
new vertices, and add an edge joining the two new vertices. As illustrated 
below, K3.3 arises from K, by one expansion on two disjoint edges. 


—> 
expansion 
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If G is connected, then the expanded graph G’ is also connected: a path 
between old vertices that traversed a replaced edge has merely lengthened, and 
a path to a new vertex in G’ is obtained from a path in G to a neighbor. 

If G has no cut-edge, then every edge lies on a cycle (Theorem 1.2.14). 
These cycles remain in G’ (those using replaced edges become longer). The 
edge joining the two new vertices in G’ also lies on a cycle using a path in G 
between the edges that were replaced. Theorem 1.2:14 now implies that G’ has 
no cut-edge. 

We have proved that if G is connected and has no cut-edge, then the same 
holds for G’. We might think we have proved by induction on m ‘that every 3- 
regular simple connected graph with 2m vertices has no cut-edge, but the graph 
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below is a counterexample. The proof fails because we cannot build every 3- 
regular simple connected graph from K, by expansions. We cannot even obtain 
all those without cut-edges, as shown in Exercise 66. a 


pO-<X 


Appendix A presents another example of the induction trap. 


GRAPHIC SEQUENCES 


Next we consider all the vertex degrees together. 


1.3.27. Definition. The degree sequence of a graph is the list of vertex de- 
grees, usually written in nonincreasing order, as d; > --- > d,. 


Every graph has a degree sequence, but which sequences occur? That is, 
given nonnegative integers d,,..., d,, is there a graph with these as the vertex 
degrees? The degree-sum formula implies that }° d; must be even. When we 
allow loops and multiple edges, TONCAS. 


1.3.28. Proposition. The nonnegative integers d;,...,d, are the vertex de- 
grees of some graph if and only if }° d; is even. 


Proof: Necessity. When some graph G has these numbers as its vertex degrees, 
the degree-sum formula implies that }° d; = 2e(G), which is even. 

Sufficiency. Suppose that }° d; is even. We construct a graph with vertex 
set v1,...,U, and d(v;) = d; for alli. Since }°d; is even, the number of odd 
values is even. First form an arbitrary pairing of the vertices in {v;: d; is odd}. 
For each resulting pair, form an edge having these two vertices as its endpoints. 
The remaining degree needed at each vertex is even and nonnegative; satisfy 
this for each i by placing [d;/2| loops at v;. | 


This proof is constructive; we could also use induction (Exercise 56). The 
construction is easy with loops available. Without them, (2, 0, 0) is not realiz- 
able and the condition is not sufficient. Exercise 63 characterizes the degree 
sequences of loopless graphs. We next characterize degree sequences of simple 
graphs by a recursive condition that readily yields an algorithm. Many other 
characterizations are known; Sierksma—Hoogeveen [1991] lists seven. 


1.3.29. Definition. A graphic sequence is a list of nonnegative numbers that 
is the degree sequence of some simple graph. A simple graph with degree 
sequence d “realizes” d. 
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1.3.30. Example. A recursive condition. The lists 2,2,1,1 and 1,0,1 are 
graphit. The graph K2 + K, realizes 1,0,1. Adding a new vertex adjacent 
to vertices of degrees 1 and 0 yields a graph with degree sequence 2, 2, 1, 1, 
as shown below. Conversely, if a graph realizing 2, 2, 1, 1 has a vertex w with 
neighbors of degrees 2 and 1, then deleting w yields a graph with degrees 1, 0, 1. 


Leis 


Similarly, to test 33333221, we seek a realization with a vertex w of de- 
gree 3 having three neighbors of degree 3. This exists if and only if 2223221 is 
graphic. We reorder this and test 3222221. We continue deleting and reorder- 
ing until we can tell whether the remaining list is realizable. If it is, then we 
insert vertices with the desired neighbors to work back to a realization of the 
original list. The realization is not unique. 


The next theorem implies that this recursive test works. | 
33333221 Z 3222221 221111 Z 11110 
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1.3.31. Theorem. (Havel [1955], Hakimi [1962]) For n > 1, an integer list d 
of size n is graphic if and only if d’ is graphic, where d’ is obtained from d 
by deleting its largest element A and subtracting 1 from its A next largest 
elements. The only 1-element graphic sequence is d, = 0. 


Proof; For n = 1, the statement is trivial. For n > 1, we first prove that the 
condition is sufficient. Given d with d; > --- > d, and a simple graph G’ with 
degree sequence d’, we add a new vertex adjacent to vertices in G’ with degrees 
dg —1,...,d,41 — 1. These d;‘are the A largest elements of d after (one copy of) 
A itself, but dp —1,...,d,,1 — 1 need not be the A largest numbers in d’. 

To prove necessity, we begin with a simple graph G realizing d and produce 
a simple graph G’ realizing d’. Let w be a vertex of degree A in G. Let S bea 
set of A vertices in G having the “desired degrees” dg, .. + dasa. If N(w) = S, 
then we delete w to obtain G’. 

Otherwise, some vertex of S is missing from N(w). In this case, we modify 
G to increase |N(w) M S| without changing any vertex degree. Since |N(w)M S| 
can increase at most A times, repeating this converts G into another graph G* 
that realizes d and has S as the neighborhood of w. From G* we then delete w 
to obtain the desired graph G’ realizing d’. 
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To find the modification when N(w) 4 S, we choose x € S and z ¢ S so that 
w<zandw # x. We want to add wx and delete wz, but we musi preserve 
vertex degrees. Since d(x) > d(z) and already w is a neighbor of z but not x, 
there must be a vertex y adjacent to x but not to z. Now we delete {wz, xy} and 
add (wx, yz} to increase |V(w) NM S|. | 


Theorem 1.3.31 tests a list of n numbers by testing a list of n — 1 numbers; 
it yields a recursive algorithm to test whether d is graphic. The necessary 
condition “)°d; even” holds implicitly: )°d/ = (> d;) — 2A implies that )° d/ 
and )/d; have the same parity. 

An algorithmic proof using “local change” pushes an object toward a de- 
sired condition. This can be phrased as‘proof by induction, where the induction 
parameter is the “distance” from the desired condition. In the proof of Theorem 
1.3.31, this distance is the number of vertices in S that are missing from N(w). 

We used edge switches to transform an arbitrary graph with degree se- 
quence d into a graph satisfying the desired condition. Next we will show that 
every simple graph with degree pequente d can be transformed by such switches 
into every other. 


1.3.32. Definition. A 2-switch is the replacement of a pair of edges xy and 
zw in a simple graph by the edges yz and wx, given that yz and wx did not 
appear in the graph originally. 
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The dashed lines above indicate nonadjacent pairs. If y ~ z or w <— x, 
then the 2-switch cannot be performed, because the resulting graph would not 
be simple. A 2-switch preserves all vertex degrees. If some 2-switch turns H 
into H*, then a 2-switch on the same four vertices turns H* into H. Below we 
illustrate two successive 2-switches. 
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1.3.33.* Theorem. (Berge [1973, p153—154]) If G and H are two simple graphs 
with vertex set V, then dg(v) = dy(v) for every v € V if and only if there is 
a sequence of 2-switches that transforms G into H. 


Proof: Every 2-switch preserves vertex degrees, so the condition is sufficient. 
Conversely, when dg(v) = dy(v) for all v € V, we obtain an appropriate se- 
quence of 2-switches by induction on the number of vertices, n. If n < 3, then 
for each d;,..., d, there is at most one simple graph with d(v;) = d;. Hence we 
can use n = 3 as the basis step. 

Consider n > 4, and let w be a vertex of maximum degree, A. Let S = 
{v1,..., va} be a fixed set of vertices with the A highest degrees other than w. 
As in the proof of Theorem 1.3.31, some sequence of 2-switches transforms G 
to a graph G* such that Ng-«(w) = S, and some such sequence transforms H to 
a graph H* such that Ny-(w) = S. 
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Since Ng-(w) = Ny-(w), deleting:-w leaves simple graphs G’ = G* — w 
and H’ = H* — w with dg:(v) = dy-(v) for every vertex v. By the induction 
hypothesis, some sequence of 2-switches transforms G’ to H’. Since these do nct 
involve w, and w has the same neighbors in G* and H*, applying this sequence 
transforms G* to H*. Hence we can transform G to H by transforming G to G*, 
then G* to H*, then (in reverse order) the transformation of H to H*. :] 


We could also phrase this using induction on the number of edges appear- 
ing in exactly one of G and H, which is 0 if and only if they are already the 
same. In this approach, it suffices to find a 2-switch in G that makes it closer 
to H or a 2-switch in H that makes it closer to G. 


EXERCISES 


A statement with a parameter must be proved for all values of the parameter; it 
cannot be proved by giving examples. Counting a set includes providing proof. 


1.3.1. (—) Prove or disprove: If u and v are the only vertices of odd degree in a graph G, 
then G contains a u, v-path. 


1.3.2. (—) In a class with nine students, each student sends valentine cards to three 
others. Determine whether it is possible that each student receives cards from the same 
three students to whom he or she sent cards. 
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1.3.3. (—) Let u and v be adjacent vertices in a simple graph G. Prove that uv belongs 
to at least d(u) + d(v) — n(G) triangles in G. 


1.3.4. (—) Prove that the graph below is isomorphic to Q4. 


1.3.5. (—) Count the copies of P3; and C, in Q,. 


1.3.6. (—) Given graphs G and H, determine the number of components and maximum 
degree of G + H in terms of the those parameters for G and H. 


1.3.7. (—) Determine the maximum number of edges in a bipartite subgraph of P,, of 
C,,, and of K,,. 


1.3.8. (—) Which of the following are graphic sequences? Provide a construction or a 
proof of impossibility for each. 


a) (5,5,4,3,2,2,2,1), 
b) (5,5,4,4,2,2,1,1), 


c) (5,5,5,3,2,2,1,1), 
d) (5,5,5,4,2,1,1,1). 
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1.3.9. In a league with two divisions of 13 teams each, determine whether it is possi- 
ble to schedule a season with each team playing nine games against teams within its 
division and four games against teams in the other division. 


1.3.10. Let /, m,n be nonnegative integers with / +m =n. Find necessary and sufficient 
conditions on /,m,n such that there exists a connected simple n-vertex graph with / 
vertices of even degree and m vertices of odd degree. 


1.3.11. Let W be a closed walk in a graph G. Let H be the subgraph of G consisting of 
edges used an odd number of times in W. Prove that d;(v) is even for every v € V(G). 


1.3.12. (!) Prove that an even graph has no cut-edge. For each k > 1, construct a 
2k + 1-regular simple graph having a cut-edge. 


1.3.13. (+) A mountain range is a polygonal curve from (a, 0) to (b, 0) in the upper 
half-plane. Hikers A and B begin at (a, 0) and (b, 0), respectively. Prove that A and 
B can meet by traveling on the mountain range in such a way that at all times their 
heights above the horizontal axis are the same. (Hint: Define a graph to model the 
movements, and use Corollary 1.3.5.) (Communicated by D.G. Hoffman.) 
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1.3.14. Prove that every simple graph with at least two vertices has two vertices of 
equal degree. Is the conclusion true for loopless graphs? 


1.3.15. For each k > 3, determine the smallest n such that 
a) there is a simple k-regular graph with n vertices. 
b) there exist nonisomorphic simple k-regular graphs with n vertices. 


1.3.16. (+) For k > 2 and g > 2, prove that there exists an k-regular graph with girth 
g. (Hint: To construct such a graph inductively, make use of an k — 1-regular graph H 
with girth g and a graph with girth [2/2] that is n(H)-regular. Comment: Such a graph 
with minimum order is a (k, g)-cage.) (Erdés—Sachs [1963]) 


1.3.17. (!) Let G be a graph with at least two vertices. Prove or disprove: 
a) Deleting a vertex of degree A(G) cannot increase the average degree. 
b) Deleting a vertex of degree 5(G) cannot reduce the average degree. 


1.3.18. (!) For k > 2, prove that a k-regular bipartite graph has no cut-edge. 


1.3.19. Let G be a claw-free graph. Prove that if A(G) > 5, then G has a 4-cycle. For 
alln € N, construct a 4-regular claw-free graph of order at least n that has no 4-cycle. 


1.3.20. (!) Count the cycles of length n in K,, and the cycles of length 2n in Knn. 
1.3.21. Count the 6-cycles in Ky n- 


1.3.22. (!) Let G be a nonbipartite graph with n vertices and minimum degree k. Let / 
be the minimum length of an odd cycle in G. 

a) Let C be a cycle of length / in G. Proye that every vertex not in V(C) has at most 
two neighbors in V(C). 

b) By counting the edges joining V(C) ind G—V(C) in two ways, prove that n > kl/2 
(and thus / < 2n/k). (Campbell—Staton [1991]) 

c) When k is even, prove that the inequality of part (b) is best possible. (Hint: form 
a graph having k/2 pairwise disjoint /-cycles.) 


1.3.23. Use the recursive description of Q, (Example 1.3.8) to prove that e(Q,) = k2*-}. 
1.3.24. Prove that K23 is not contained in any hypercube Q,. 


1.3.25. (!) Prove that every cycle of length 27 in a hypercube is contained in a subcube 
of dimension at most r. Can a cycle of length 2r be contained in a subcube of dimension 
less than r? 


1.3.26. (!) Count the 6-cycles in Q3. Prove that every 6-cycle in Q, lies in exactly one 
3-dimensional subcube. Use this to count the 6-cycles in Q, for k > 3. 


1.3.27. Given k € N, let G be the subgraph of Q2,,, induced by the vertices in which the 
number of ones and zeros differs by-1. Prove that G is regular, and compute n(G), e(G), 
and the girth of G. 


1.3.28. Let V be the set of binary k-tuples. Define a simple graph Q, with vertex set V 
by putting u < v if and only ifu and v agree in exactly one coordinate. Prove that Q, is 
isomorphic to the hypercube Q, if and only if k is even. (D.G. Hoffman) 


1.3.29. («+) Automorphisms of the k-dimensional cube Q,. 

a) Prove that every copy of Q; in Q, is a subgraph induced by a set of 2/ vertices 
having specified values on a fixed set of k — j coordinates. (Hint: Prove that a copy of 
Q; must have two vertices differing in j coordinates.) 

b) Use part (a) to count the automorphisms of Q,. 
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1.3.30. Prove that every edge in the Petersen graph belongs to exactly four 5-cycles, 
and use this to show that the Petersen graph has exactly twelve 5-cycles. (Hint: For the 
first part, extend the edge to a copy of P, and apply Proposition 1.1.38.) 


1.3.31. (!) Use complete graphs and counting arguments (not algebra!) to prove that 
a) (5) = (§) + k(n —k&) + ("5") for 0 <k <n. 


b) If }>n; =n, then >> C) < C). 


1.3.32. (!) Prove that the number of simple even graphs with vertex set [n] is ("2"), 
(Hint: Establish a bijection to the set of all simple graphs with vertex set [n — 1].) 


1.3.33. (+) Let G be a triangle-free simple n-vertex graph such that every pair of non- 
adjacent vertices has exactly two common neighbors. 

a) Prove that n(G) = 1+ Co), where x € V(G). Conclude that G is regular. 

b) When k = 5, prove that deleting any one vertex and its neighbors from G leaves 
the Petersen graph. (Comment: When k = 5, the graph G is in fact the graph obtained 
from Q, by adding edges joining complementary vertices.) 


1.3.34. (+) Let G be a kite-free simple n-vertex graph such that every pair of nonadja- 
cent vertices has exactly two common neighbors. Prove that G is regular. (Galvin) 


1.3.35. (+) Let n and k be integers such that 1 < k <n —1. Let G bea simple n-vertex 
graph such that every k-vertex induced subgraph of G has m edges. 

a) Let G’ be an induced subgraph of & with / vertices, where / > k. Prove that 
eG’) = m()/(.)- 

b) Use part (a) to prove that G € (KucKr }. (Hint: Use part (a) to prove that the 
number of edges with endpoints u, v is independent of the choice of u and v.) 


1.3.36. Let G be a 4-vertex graph whose list of subgraphs obtained by deleting one 
vertex appears below. Determine G. 
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1.3.37. Let H be a graph formed by deleting a vertex from a loopless regular graph G 
with n(G) > 3. Describe (and justify) a method for obtaining G from H. 


1.3.38. Let G be a graph with at least 3 vertices. Prove that G is connected if and only if 
at least two of the subgraphs obtained by deleting one vertex ef G are connected. (Hint: 
Use Proposition 1.2.29.) 


1.3.39. («+) Prove that every disconnected graph G with at least three vertices is re- 
constructible. (Hint: Having used Exercise 1.3.38 to determine that G is disconnected, 
use G;,...,G, to find a component M of G that occurs the most times among the compo- 
nents with the maximum number of vertices, use Proposition 1.2.29 to choose v so that 
L = M — vis connected, and reconstruct G by finding some G — v; in which a copy of M@ 
became a copy of L.) 


1.3.40. (!) Let G be an n-vertex simple graph, where n > 2. Determine the maximum 
possible number of edges in G under each of the following conditions. 

a) G has an independent set of size a. 

b) G has exactly k components. 

c) G is disconnected. 
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1.3.41. (!) Prove or disprove: If G is an n-vertex simple graph with maximum degree 
fn /2] and minimum degree |n/2| — 1, then G is connected. 


1.3.42. Let S be a set of vertices in a k-regular graph G such that no two vertices in 
S are adjacent or have a common neighbor. Use the pigeonhole principle to prove that 
|S| < [n(G)/(k + 1)|. Show that the bound is best possible for the cube Q3. (Comment: 
The bound is not best possible for Q4.) 


1.3.43. (+) Let G be a simple graph with no isolated vertices, and let a = 2e(G)/n(G) be 
the average degree in G. Let t(v) denote the average of the degrees of the neighbors of v. 
Prove that t(v) > a for some v € V(G). Construct an infinite family of connected graphs 
such that t(v) > a for every vertex v. (Hint: For the first part, compute the average of 
t(v), using that x/v + y/x > 2 when x, y > 0.) (Ajtai-Komlés—Szemerédi [1980]) 


1.3.44. (!) Let G be a loopless graph with average vertex degree a = 2e(G)/n(G). 

a) Prove that G —.x has average degree at least a if and only if d(x) < a/2. 

b) Use part (a) to give an algorithmic proof that if a > 0, then G has a subgraph 
with minimum degree greater than «/2. 

c) Show that there is no constant c greater than 1/2 such that G must have a 
subgraph with minimum degree greater than ca; this proves that the bound in part (b) 
is best possible. (Hint: Use K;,,-1.) 


1.3.45. Determine the maximum number of edges in a bipartite subgraph of the Pe- 
tersen graph. 


k S 


1.3.46. Prove or disprove: Whenever the algorithm of Theorem 1.3.19 is applied to a 
bipartite graph, it finds the bipartite subgraph with the most edges (the full graph). 


0 


1.3.47. Use induction on n(G) to prove that every nontrivial loopless graph G has a 
bipartite subgraph H such that H has more than e(G)/2 edges. 


1.3.48. Construct graphs G,, G2,..., with 6, having 2n vertices, such that lim,... f, = 
1/2, where f,, is the fraction of E(G,,) belonging to the largest bipartite subgraph of G,,. 


1.3.49. For each k € N and each loopless graph G, prove that G has a k-partite subgraph 
H (Definition 1.1.12) such that e(H) > (1 — 1/k)e(G). 


1.3.50. (+) For n > 3, determine the minimum number of edges in a connected n-vertex 
graph in which every edge belongs to a triangle. (Erddés [1988]) 


1.3.51. (+) Let G be a simple n-vertex graph, where n > 3. 

a) Use Proposition 1.3.11 to prove that if G has more than n?/4 edges, then G has 
a vertex whose deletion leaves a graph with more than (n — 1)*/4 edges. (Hint: In every 
graph, the number of edges is an integer.) 

b) Use part (a) to prove by induction that G contains a triangle if e(G) > n?/4. 


1.3.52. Prove that every n-vertex triangle-free simple graph with the maximum number 
of edges is isomorphic to K,./2).[n/2;. (Hint: Strengthen the proof of Theorem 1.3.23.) 


1.3.53. (!) Each game of bridge involves two teams of two partners each. Consider 
a club in which four players cannot play a game if two of them have previously been 
partners that night. Suppose that 15 members arrive, but one decides to study graph 
theory. The other 14 people play until each has been a partner with four others. Next 
they succeed in playing six more games (12 partnerships), but after that they cannot 
find four players containing no pair of previous partners. Prove that if they can convince 
the graph theorist to play, then at least one more game can be played. (Adapted from 
Bondy—Murty [1976, p111]). 
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1.3.54. (+) Let G be a simple graph with x vertices. Let t(G) be the total number of 
triangles in G and G together. 

a) Prove that t(G) = (5) — (n — 2)e(G) + ae nies (“0”) triangles. (Hint: Consider 
the contribution made to each side by each triple of vertices. 

b) Prove that 1(G) > n(n — 1)(n — 5)/24. (Hint: Use a lower bound on } evi) (“9”) 
in terms of average degree.) 

c) When n — 1 is divisible by 4, construct a graph achieving equality in part (b). 
(Goodman [1959]) 


1.3.55. (+) Maximum size with no induced P4. 

a) Let G be the complement of a disconnected simple graph. Prove that e(G) < 
A(G)*, with equality only for Ka(c),a(G)- 

b) Let G be a simple connected P,-free graph with maximum degree k. Prove that 
e(G) < k*. (Seinsche [1974], Chung—West [1993]) 


) 
1.3.56. Use induction (on n or on 5 d;) to prove that if d),...,d, are nonnegative inte- 
gers and ) d; is even, then there is an n-vertex graph with vertex degrees dj,..., dy. 
(Comment: This requests an alternative proof of Proposition 1.3.28.) 


1.3.57. (!) Let n be a positive integer. Let d be a list of n nonnegative integers with even 
sum whose largest entry is less than n and differs from the smallest entry by at most 1. 
Prove that d is graphic. (Hint: Use the Havel-Hakimi Theorem. Example: 443333 is 
such a list, as is 33333322.) 
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1.3.58. Generalization of Havel-Hakimi Theorem. Given a nonincreasing list d of non- 
negative integers, let d’ be obtained by deleting d, and subtracting 1 from the k largest 
elements remaining in the list. Prove that d is graphic if and only if d’ is graphic. (Hint: 
Mimic the proof of Theorem 1.3.31.) (Wang=Kleitman [1973]) 


1.3.59. Define d = (d),..., dx) by do; = dois =i for 1 <i <k. Prove that d is graphic. 
(Hint: Do not use the Havel-Hakimi Theorem.) 


1.3.60. (+) Let d be a list of integers consisting of k copies of a and n — k copies of b, 
with a > b > 0. Determine necessary and sufficient conditions for d to be graphic. 


1.3.61. (!) Suppose that G = G and that n(G) = 1 mod 4. Prove that G has at least one 
vertex of degree (n(G) — 1)/2. 


1.3.62. Suppose that n is congruent to 0 or 1 modulo 4. Construct an n-vertex simple 
graph G with }(3) edges such that A(G) — 6(G) < 1. 


1.3.63. (!) Let di,...,d, be integers such that d; > .--- > d, > 0. Prove that there is 
a loopless graph (multiple edges allowed) with degree sequence d,...,d, if and only if 
>> d; is even and d; < d, +---+4d,. (Hakimi [1962]) 


1.3.64. (!) Let d, < --- < d, be the vertex degrees of a simple graph G. Prove that G is 
connected if dj => j when j <n —1-d,. (Hint: Consider a component that omits some 
vertex of maximum degree.) 


1.3.65. (+) Let a, <--- < a be distinct positive integers. Prove that there is a simple 
graph with a, + 1 vertices whose set of distinct vertex degrees is a),...,a,. (Hint: Use 
induction on k to construct such a graph.) (Kapoor—Polimeni—Wall [1977]) 


1.3.66. (*) Expansion of 3-regular graphs (see Example 1.3.26). For n = 4k, where 
k > 2, construct a connected 3-regular simple graph with n vertices that has no cut- 
edge but cannot be obtained from a smaller 3-regular simple graph by expansion. (Hint: 
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The desired graph must have no edge to which the inverse “erasure” operation can be 
applied to obtain a smaller simple graph.) 


1.3.67. (*) Construction of 3-regular simple graphs 

a) Prove that a 2-switch can be performed by performing a sequence of expansions 
and erasures; these operations are defined in Example 1.3.26. (Caution: Erasure is not 
allowed when it would produce multiple edges.) 

b) Use part (a) to prove that every 3-regular simple graph can be obtained from K4 
by a sequence of expansions and erasures. (Batagelj [1984]) 


1.3.68. (*) Let G and H be two simple bipartite graphs, each with bipartition X,Y. 
Prove that dg(v) = dy(v) for all v € X UY if and only if there is a sequence of 2-switches 
that transforms G into H without ever changing the bipartition (each 2-switch replaces 
two edges joining X and Y by two other edges joining X and Y). 


1.4. Directed Graphs 


We have used graphs to model symmetric relations. Relation need not be 
symmetric; in general, a relation on S can be any set of ordered pairs in 5 x S 
(see Appendix A). For such relations, we need a more general model. 


DEFINITIONS AND EXAMPLES 
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Seeking a graphical representation of the information in a general relation 
on S leads us to a model of directed graphs. 


1.4.1. Example. For natural numbers x, y, we say that x is a “maximal divisor” 
of y if y/x is a prime number. For S CN, the set R = {(x, y) € S?: x isa maximal 
divisor of y} is a relation on S. To represent it graphically, we name a point 
in the plane for each element of S and draw an arrow from x to y whenever 


(x, y) € R. Below we show the result when S = [12]. | 
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1.4.2. Definition. A directed graph or digraph G is a triple consisting of a 
vertex set V(G), an edge set E(G), and a function assigning each edge 
an ordered pair of vertices. The first vertex of the ordered pair is the tail 
of the edge, and the second is the head; together, they are the endpoints. 
We say that an edge is an edge from its tail to its head. 
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The terms “head” and “tail” come from the arrows used to draw digraphs. 
As with graphs, we assign each vertex a point in the plane and each edge a curve 
joining its endpoints. When drawing a digraph, we give the curve a direction 
from the tail to the head. 

When a digraph models a relation, each ordered pair is the (head, tail) 
pair for at most one edge. In this setting as with simple graphs, we ignore the 
technicality of a function assigning endpoints to edges and simply treat an edge 
as an ordered pair of vertices. 


1.4.3. Definition. In a digraph, a loop is an edge whose endpoints are equal. 
Multiple edges are edges having the same ordered pair of endpoints. A 
digraph is simple if each ordered pair is the head and tail of at most one 
edge; one loop may be present at each vertex. 

In a simple digraph, we write uv for an edge with tail u and head v. 
If there is an edge from u to v, then v is a successor of u, and u is a 
predecessor of v. We write u — v for “there is an edge from u to v”. 


1.4.4. Application. A finite state machine (also called finite automaton 
or discrete system) has a number of possible “states”. Such a system can 
be modeled using a digraph in which vertices represent the states and edges 
represent the possible transitions between states. 

Transitions inherently move in one direction, so digraphs provide the ap- 
propriate model. Labels on the edges can be used to record the events that cause 
the transitions. When an event causes-the system to remain in the same state, 
we have a loop. When two types of events can cause a particular transition, we 
might use multiple edges. 

Consider a light controlled by two switches, often called a “three-way 
switch”. The first switch can be up or down, the second switch can be up or 
down, and the light can be on (+) or off (—). Thus there are eight states. Tran- 
sitions between states result by flipping switches. In the drawing below, the 
horizontal edges represent transitions caused by flipping the first switch, and 
the vertical edges represent transitions caused by flipping the second switch. 
(Drawing vertices large enough to put labels inside is not uncommon when 
discussing finite state machines, but we will stick with filled dots.) a 


1.4.5.* Application. Edge labels can be used to record transition probabili- 
ties when a system operates randomly. The probabilities on the edges leaving a 
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vertex sum to 1, and the system is called a Markov chain. Methods of linear al- 
gebra can be used to compute the long-term fraction of time spent in each state. 

For example, suppose that weather has two states: good and bad. Air 
masses move slowly enough that tomorrow’s weather tends to be like today’s. 
In most places, storm systems don’t linger long, so we might have transition 
probabilities as shown below. If we record states hourly instead of daily, then 
the probably of remaining in the same state is much higher. a 
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1.4.6. Definition. A digraph is a path if it is a simple digraph whose vertices 
can be linearly ordered so that there is an edge with tail u and head v if and 
only if v immediately follows u in the vertex ordering. A cycle is defined 
similarly using an ordering of the vertices on a circle. 


1.4.7. Example. Functional digraphs. We can study a function f: A > A us- 
ing digraphs. The functional digraph of f is the simple digraph with vertex 
set A and edge set {(x, f(x): x € A}. For each x, the single edge with tail x 
points to the image of x under f. 

Following a path in a functional digraph corresponds to iterating the func- 
tion. In a permutation, each element isthe image of exactly one element, so the 
functional digraph has one head and one tail at each vertex. Hence the func- 
tional digraph of a permutation consists of disjoint cycles. Below we show the 
functional digraph for a permutation of [7]. a 


1.4.8.* Remark. We often use the same names for corresponding concepts in 
the graph and digraph models. Many authors replace “vertex” and “edge” with 
“node” and “arc” to discuss digraphs, but this obscures the analogies. Some 
results have the same statements and proofs; it would be wasteful to repeat 
them just to change terminology (especially in Chapter 4). 

Also, a graph G can be modeled using a digraph D in which each edge 
uv € E(G) is replaced with uv, vu € E(D). In this way, results about digraphs 
can be applied to graphs. Since the notion of “edge” in digraphs extends the 
notion of “edge” in graphs, using the same name makes sense. 

Some authors write “directed path” and “directed cycle” for our concepts of 
path and cycle in digraphs, but the distinction is unnecessary; for the “weak” 
version that does not follow the arrows, we can speak of a path or cycle in the 
graph obtained by ignoring the directions, which we define next. | 
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1.4.9. Definition. The underlying graph of a digraph D is the graph G ob- 
tained by treating the edges of D as unordered pairs; the vertex set and 
edge set remain the same, and the endpoints of an edge are the same in G 
as in D, but in G they become an unordered pair. 


<> <> 


D G 


Most ideas and methods of graph theory arise in the study of ordinary 
graphs. Digraphs can be a useful additional tool, especially in applications, as 
we have tried to suggest. We hope that describing the analogies and contrasts 
between graphs and digraphs will help clarify the concepts. 

When comparing a digraph with a graph, we usually use G for the graph 
and D for the digraph. When discussing a single digraph, we often use G. 


1.4.10. Definition. The definitions of subgraph, isomorphism, decomposi- 
tion, and union are the same for graphs and digraphs. In the adjacency 
matrix A(G) of a digraph G, the-entry in position i, j is the number of 
edges from y; to v;. In the incidence matrix M(G) of a loopless digraph 


G, we set m;,; = +1 if v; is the tail of e; and m;,; = —1 if v; is the head of e;. 
1.4.11. Example. The underlying graph of the digraph below is the graph of 
Example 1.1.19; note the similarities and differences in their matrices. | 

wx y Zz a b c od e 

wf001 0 —-1 +1 0 0 0O 

x}/1010 +1 0 +1 -1 O 

y!'!O101 0 -1 -1 +1 +1 

z\0 0 0 0 0 0 0 0 -1 


A(G) M(G) 


To define connected digraphs, two options come to mind. We could require 
only that the underlying graph be connected. However, this does not capture 
the most useful sense of connection for digraphs. 


1.4.12. Definition. A digraph is weakly connected if its underlying graph is 
connected. A digraph is strongly connected or strong if for each ordered 
pair u, v of vertices, there is a path from u to v. The strong components 
of a digraph are its maximal strong subgraphs. 


1.4.13. Example. The 2-vertex digraph consisting only of the edge xy has an 
x, y-path but no y, x-path and is not strongly connected. As a digraph, an n- 
vertex path has n strong components, but a cycle has only one. In the digraph 
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below, the three circled subdigraphs are the strong components. Properties of 
strong components are discussed in Exercises 10-13. a 


1.4.14.* Application. Games. Many games with two players can be described 
as finite state machines. The vertex set is the set of possible states of the game. 
There is an edge from state x to state y if some move can be made (by the player 
whose turn it is to play) to reach state y from state x. 

Let W be the set of vertices for winning positions; a player who brings the 
game to such a state wins. No edges leave W. A player who brings the game 
to a state with an edge to W loses, since the other player then reaches W. One 
way to analyze the game is to seek a set S of pairwise nonadjacent vertices 
containing W such that every vertex outside S has an edge to a vertex in S. A 
player who can bring the game to a pom non in S wins, but one who must move 
from a position in S loses. 

For example, consider a game witntwo piles of pennies. At his or her turn, 
each player can remove any portion of a single pile. The player who removes 
the last coin wins. The possible game positions are the nonnegative integer 
pairs (r,s). The definition of the game specifies (0, 0) as the only winning posi- 
tion. However, the set S of desirable positions is {(r,r): r > 0}. Since only one 
coordinate can decrease on a move, there is no edge within S. For each vertex 
(r,s) ¢ S, a player can remove |r — s| from the larger pile to reach S. 

The general game of Nim starts with an arbitrary number of piles with ar- 
bitrary sizes, but otherwise the rules of the game are the same as this. Exercise 
18 guarantees that Nim always has a winning strategy set S, since the digraph 
for this game has no cycles. If the initial position is in S, then the second player 
wins (assuming optimal play). Otherwise, the first player wins. | 


1.4.15.* Definition. A kernel in the digraph D is a set S C V(D) such that S$ 
induces no edges and every vertex outside S has a successor in S. 


A digraph that is an odd cycle has no kernel (Exercise 17), but forbidding 
odd cycles as subdigraphs always yields a kernel. In proving this, all uses of 
paths, cycles, and walks are in the directed sense. We need several statements 
about movement in digraphs that hold by the same proofs as in graphs. For 
example, every u, v-walk in a digraph contains a u, v-path (Exercise 3), and 
every closed odd walk in a digraph contains an odd cycle (Exercise 4). The 
concept of distance from x to y will be explored more fully in Section 2.1; it is 
the least length of an x, y-path. 


58 Chapter 1: Fundamental Concepts 


1.4.16.* Theorem. (Richardson [1953]) Every digraph having no odd cycle has 
a kernel. 


Proof; Let D be such a digraph. We first consider the case that D is strongly 
connected; see the figure on the left below. Given an arbitrary vertex y € V(D), 
let S be the set of vertices with even distance to y. Every vertex with odd 
distance to y has a successor in S, as desired. 

If the vertices of S are not pairwise nonadjacent, then there is an edge uv 
with u,v € S. By the definition of S, there is a u, y-path P of even length and a 
v, y-path P’ of even length. Adding uv at the start of P’ yields a u, y-walk W of 
odd length. Because D is strong, D has a y, u-path Q. Combining Q with one 
of P or W yields a closed odd walk in D. This is impossible, since a closed odd 
walk contains an odd cycle. Thus S is a kernel in D. 


: 


For the general case, we use induction on n(D). 

Basis step: n(D) = 1. The only example i is a single vertex with no loop. 
This vertex is a kernel by itself. 

Induction step: n(D) > 1. Since:we have already proved the claim for 
strong digraphs, we may assume that D is not strong. For some strong compo- 
nent D’ of D, there is no edge from a vertex of D’ to a vertex not in D’ (Exercise 
11). We have shown that D’ has a kernel; let S’ be a kernel of D’. 

Let D” be the subdigraph obtained from D by deleting D’ and all the pre- 
decessors of S’. By the induction hypothesis, D” has a kernel; let S” be a kernel 
of D”. We claim that S’ U S” is a kernel of D. Since D” has no predecessor of S$’, 
there is no edge within S’ US”. Every vertex in D” — S” has a successor in S”, 
and all other vertices not in S’U S” have a successor in S’. | 


th_ 


VERTEX DEGREES 


In a digraph, we use the same notation for number of vertices and num- 
ber of edges as in graphs. The notation for vertex degrees incorporates the 
distinction between heads and tails of edges. 


1.4.17. Definition. Let v be a vertex in a digraph. The outdegree dt(v) is 
the number of edges with tail v. The indegree d-(v) is the number of 
edges with head v. The out-neighborhood or successor set N*(v) is 
{x € V(G): v > x}. The in-neighborhood or predecessor set N~(v) is 
{x € V(G): x — v}. The minimum and maximum indegree are 5~(G) and 
A~(G); for outdegree we use 5*(G) and At(G). 


The digraph analogue of the degree-sum formula for graphs is easy. 
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1.4.18. Proposition. In a digraph G, >) .-y(g) d*(v) = e(G) = DVoveyigyd (U). 
Proof: Every edge has exactly one tail and exactly one head. | 


The digraph analogue of degree sequence is the list of “degree pairs” 
(dt(v;),d~(v;)). When is a list of pairs realizable as the degree pairs of a 
digraph? As with graphs, this is easy when we allow multiple edges. 


1.4.19.* Proposition. A list of pairs of nonnegative integers is realizable as 
the degree pairs of a digraph if and only if the sum of the first coordinates 
equals the sum of the second coordinates. 


Proof: The condition is necessary because every edge has one tail and one head, 
contributing once to each sum. 

For sufficiency, consider the pairs {(d* ,d;): 1 < i < n} and vertices 
vj,...,U,- Letm = ea ee 24; Consider m dots. Give the dots positive 
labels, with d;* of them having label i. Also give the dots negative labels, with 
d; of them having label —j. For each dot with labels i and —j, place an edge 
from v; to v;. This creates a digraph with d*(v;) = d;* and d~(v;) =d;. | 


The analogous question for simple digraphs is harder. The question can 
be rephrased in terms of bipartite graphs via a transformation that is useful in 
many problems about digraphs. 


math_b 


1.4.20.* Definition. The split of a digraph D is a bipartite graph G whose 
partite sets V*, V~ are copies of V(D). For each vertex x € V(D), there is 
one vertex x*+ € V* and one vertex x” € V~. For each edge from u to v in 
D, there is an edge with endpoints ut, v~ in G. 


w ut ye wt x 

v ue v- Ww x7 
1.4.21.* Remark. The degrees of the vertices in the split of D are the indegrees 
and outdegrees of the vertices in D. 

Furthermore, an X, Y-bigraph G with |X| = |Y| = n can be transformed 
into an n-vertex digraph D by putting an edge v;v; in D for each edge x;y; in G; 
now G is the split of D. (This is one reason to allow loops in simple digraphs.) 

Thus there is a simple digraph with degree pairs {(d;*,d;): 1 <i < n} if 
and only if there is a simple bipartite graph G in which the vertex degrees are 
dy,...,d* in one partite set and d;,...,d, in the other partite set. Exercise 
32 obtains a recursive test for existence of such a bipartite graph. The state- 


ment and proof are like that of the Havel—Hakimi Theorem, so we leave further 
discussion to the exercise. s 


rs 
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EULERIAN DIGRAPHS 


The definitions of trail, walk, circuit, and the connection relation are 
the same in graphs and digraphs when we list edges as ordered pairs of ver- 
tices. In a digraph, the successive edges must “follow the arrows”. In a walk 
V0, €1,---, x, Uz, the edge e; has tail v;_, and head 1;. 


1.4.22. Definition. An Eulerian trail in a digraph (or graph) is a trail con- 
taining all edges. An Eulerian circuit is a closed trail containing all 
edges. A digraph is Eulerian if it has an Eulerian circuit. 


The characterization of Eulerian digraphs is analogous to the characteri- 
zation of Eulerian graphs. The proof is essentially the same as for graphs, so 
we leave it to the exercises. 


1.4.23. Lemma. If G is a digraph with 5+(G) > 1, then G contains a cycle. The 
same conclusion holds when 6° (G) > 1. 


Proof: Let P be a maximal path in G, and let u be the last vertex of P. Since P 
cannot be extended, every successor of u must already be a vertex of P. Since 
5*(G) = 1, u has a successor v on P. ‘The edge uv completes a cycle with the 
portion of P from v to u. = | 


1.4.24. Theorem. A digraph is Eulerian if and only if d+(v) = d~(v) for each 
vertex v and the underlying graph has at most one nontrivial component. 


Proof: See Exercise 19 or Exercise 20. wi 


Every Eulerian digraph with no isolated vertices is strongly connected, 
although the characterization states that being weakly connected is sufficient. 


1.4.25. Application. de Bruijn cycles. There are 2” binary strings of length n. 
Is there a cyclic arrangement of 2” binary digits such that the 2” strings of n 
consecutive digits are all distinct? For n = 4, (0000111101100101) works. 

We can use such an arrangement to keep track of the position of a rotating 
drum (Good [1946]). Our drum has 2” rotational positions. A band around the 
circumference is split into 2” portions that can be coded 0 or 1. Sensors read n 
consecutive portions. If the coding has the property specified above, then the 
position of the drum is determined by the string read by the sensors. 

To obtain such a circular arrangement, define a digraph D, whose vertices 
are the binary (n — 1)-tuples. Put an edge from a to b if the last n — 2 entries 
of .a agree with the first n — 2 entries of b. Label the edge with the last entry 
of b. Below we show D4. We next prove that D,, is Eulerian and show how an 
Eulerian circuit yields the desired circular arrangement. | 


Section 1.4: Directed Graphs 61 


1.4.26. Theorem. The digraph D, of Application 1.4.25 is Eulerian, and the 
edge labels on the edges in any Eulerian circuit of D, form a cyclic arrange- 
ment in which the 2” consecutive segments of length n are distinct. 


Proof: We show first that D,, is Eulerian. Every vertex has outdegree 2, because 
we can append a 0 or a 1 to its name to obtain the name of a successor ver- 
tex. Similarly, every vertex has indegree 2, because the same argument applies 
when moving in reverse and putting a 0 or a 1 on the front of the name. Also, 


D,, is strongly connected, because we can reach the vertex b = (by,..., Dy_1) 
from any vertex by successively following the edges labeled b;,...,b,-1. Thus 
D,, satisfies the hypotheses of Theorem 1.4.24 and is Eulerian. 

Let C be an Eulerian circuit of D,. Arrival at vertex a = (a1,...,dn-1) 


must be along an edge with label a,,_;,-because the label on an edge entering a 
vertex agrees with the last entry of the name of the vertex. Since we delete the 
front and shift the rest to obtain the rest of the name at the head, the successive 
earlier labels (looking backward) must have been a,_2, ..., a; in order. If C next 
uses an edge with label a,, then the list consisting of the n most recent edge 
labels at that time is aj,..., a. 

Since the 2”—! vertex labels are distinct, and the two edges leaving each 
vertex have distinct labels, and we traverse each edge from each vertex exactly 
once along C, we have shown that the 2” strings of length n in the circular 
arrangement given by the edge labels along C are distinct. | 


The digraph D, is the de Bruijn graph of order n on an alphabet of size 2. 
It is useful for other purposes, because it has many vertices and few edges (only 
twice the number of vertices) and yet we can reach each vertex from any other 
by a short path. We can reach any desired vertex in n — 1 steps by introducing 
the bits in its name in order from the current vertex. 


ORIENTATIONS AND TOURNAMENTS 


There are n? ordered pairs of elements that can be formed from a vertex set 
of size n. A simple digraph allows loops but uses each ordered pair at most once 
as an edge. Thus there are n? ordered pairs that may or may not be present as 
edges, and there are 2” simple digraphs with vertex set v},..., Un. 

Sometimes we want to forbid loops. 
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1.4.27. Definition. An orientation of a graph G is a digraph D obtained from 
G by choosing an orientation (x — y or y — x) for each edge xy € E(G). 
An oriented graph is an orientation of a simple graph. A tournament 
is an orientation of a complete graph. 


An oriented graph is the same thing as a loopless simple digraph. When 
the edges of a graph represent comparisons to be performed among items cor- 
responding to the vertices, we can record the results by putting x — y when x 
does better than y in the comparison. The outcome is an orientation of G. 

The number of oriented graphs with vertices 1, ..., Uv, is 3G); the number 
of tournaments is 2). 


1.4.28. Example. Orientations of complete graphs model “round-robin tourna- 
ments”. Consider an n-team league where each team plays every other exactly 
once. For each pair u,v, we include the edge uv if u wins or vu if v wins. At 
the end of the season we have an orientation of K,. The “score” of a team is its 
outdegree, which equals its number of wins. 

We therefore call the outdegree sequence of a tournament its score se- 
quence. The outdegrees determine the indegrees, since d*(v) +d~(v) =n—1 
for every vertex v. It is easier to characterize the score sequences of tourna- 
ments than the degree sequences of simple graphs (Exercise 35). | 

A tournament may have more than one vertex with maximum outdegree, 
so there may be no clear “winner”—in the example below, every vertex has 
outdegree 2 and indegree 2. Choosing,a champion when several teams have 
the maximum number of wins can be difficult. Although there need not be a 
clear winner, we show next that there must always be a team x such that, for 
every other team z, either x beats z or x beats some team that beats z. _ 


1.4.29. Definition. In a digraph, a king is a vertex from which every vertex is 
reachable by a path of length at most 2. 


1.4.30. Proposition. (Landau [1953]) Every tournament has a king. 


Proof: Let x be a vertex in a tournament T. If x is not a king, then some vertex 
y is not reachable from x by a path of length at most 2. Hence no successor of 
x is a predecessor of y. Since T is an orientation of a clique, every successor of 
x must therefore be a successor of y. Also y > x. Hence d*(y) > d*(x). 
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If y is not a king, then we repeat the argument to find z with yet larger 
outdegree. Since T is finite, we cannot forever obtain vertices of successively 
higher outdegree. The procedure must terminate, and it can terminate only 
when we have found a king. | 


N7~(x) N*(x) 


In the language of extremality, we have proved that every vertex of maxi- 
mum outdegree in a tournament is a king. Exercises 36-38 ask further ques- 
tions about kings (see also Maurer [1980]). Exercise 39 generalizes the result 
to arbitrary digraphs. 


EXERCISES 


1.4.1. (—) Describe a relation in the real world whose digraph has no cycles. Describe 
another that has cycles but is not symmetric. 


1.4.2. (—) In the lightswitch system of Application 1.4.4, suppose the first switch be- 
comes disconnected from the wiring. Draw the digraph that models the resulting system. 


1.4.3. (—) Prove that every u, v-walk ina digraph contains a u, v-path. 


1.4.4, (—) Prove that every closed walk of odd length in a digraph contains the edges of 
an odd cycle. (Hint: Follow Lemma 1.2.15.) 


1.4.5. (—) Let G be a digraph in which indegree equals outdegree at each vertex. Prove 
that G decomposes into cycles. 


1.4.6. (—) Draw the de Bruijn graphs Dz and D4. 


1.4.7. (—) Prove or disprove: If D is an orientation of.a simple graph with 10 vertices, 
then the vertices of D cannot have distinct outdegrees. 


1.4.8. (—) Prove that there is an n-vertex tournament with indegree equal to outdegree 
at every vertex if and only ifn is odd. 


e e e e e 
1.4.9. For each n > 1, prove or disprove: Every simple digraph with n vertices has two 
vertices with the same outdegree or two vertices with the same indegree. 


1.4.10. (!) Prove that a digraph is strongly connected if and only if for each partition of 
the vertex set into nonempty sets S and 7, there is an edge from S to T. 


1.4.11. (!) Prove that in every digraph, some strong component has no entering edges, 
and some strong component has no exiting edges. 


1.4.12. Prove that in a digraph the connection relation is an equivalence relation, and 
its equivalence classes are the vertex sets of the strong components. 
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1.4.13. a) Prove that the strong components of a digraph are pairwise disjoint. 

b) Let D;,..., Dy be the strong components of a digraph D. Let D* be the loopless 
digraph with vertices v1, ..., , such that v; > v; if and only ifi # j and D has an edge 
from D; to D;. Prove that D* has no cycle. 


v 
- 


1.4.14. (!) Let G be an n-vertex digraph with no cycles. Prove that the vertices of G can 
be ordered as 1;,..., v, so that if vjv; ¢ E(G), theni < j. 


1.4.15. Let G be the simple digraph with vertex set {(i, j) ¢ Z7:0 <i <mand0O <n} 
and an edge from (i, j) to (i’, j’) if and only if (i’, j) is obtained from (i, j) by adding 1 
to one coordinate. Prove that the number of paths from (0, 0) to (m,n) in G is ("*"). 


n 


1.4.16. (+) Fermat’s Little Theorem. Let Z, denote the set of congruence classes of 
integers modulo n (see Appendix A). Let a be a natural number having no common 
prime factors with n; multiplication by a defines a permutation of Z,. Let / be the least 
natural number such that a’ =a mod n. 
a) Let G be the functional digraph with vertex set Z,, for the permutation defined 
by multiplication by a. Prove that all cyclesiin G (except the loop on n) have length /— 1. 
b) Conclude from part (a) that a"-* = 4 mod n. 


1.4.17. (*) Prove that a (directed) odd cycle is a digraph with no kernel. Construct a 
digraph that has an odd cycle as an induced subgraph but does have a kernel. 


1.4.18, (x) Prove that a digraph having no cycle has a unique kernel. 


1.4.19. Use Lemma 1.4.23 and induction on the number of edges to prove the charac- 
terization of Eulerian digraphs (Theorem 1.4.24). (Hint: Follow Theorem 1.2.26.) 


1.4.20. Prove the characterization of Eulerian digraphs (Theorem 1.4.24) using the 
notion of maximal trails. (Hint: Follow 1.2.32, the second proof of Theorem 1.2.26.) 


1.4.21, Theorem 1.4.24 establishes necessary and sufficient conditions for a digraph to 
have an Eulerian circuit. Determine (with proof), the necessary and sufficient conditions 
for a digraph to have an Eulerian trail (Definition 1.4.22). (Good [1946]) 


1.4.22, Let D be a digraph with d~(v) = d*(v) for every vertex v, except that dt(x) — 
d~ (x) = k =d7(y) — d*(y). Use the characterization of Eulerian digraphs to prove that 
D contains k pairwise edge-disjoint x, y-paths. 


1.4.23. Prove that every graph G has an orientation D that is “balanced” at each vertex, 
meaning that |d3(v) — dp(v)| < 1 for every v € V(G). 


1.4.24, Prove or disprove: Every graph G has an orientation such that for every S$ C 
V(G), the number of edges entering S and leaving S differ by at most 1. 


1.4.25. (!) Orientations and P3-decomposition. 

a) Prove that every connected graph has an orientation in which the number of 
vertices with odd outdegree is at most 1. (Rotman [1991]) 

b) Use part (a) to conclude that a simple connected graph with an even number of 
edges can be decomposed into paths with two edges. 
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1.4.26. Arrange seven 0’s and seven 1’s cyclically so that the 14 strings of four consecu- 
tive bits are all the 4-digit binary strings other than 0101 and 1010. 


1.4.27. DeBruijn sequence for any alphabet and length. Let A be an alphabet of size k. 
Prove that there exists a cyclic arrangement of k' characters chosen from A such that 
the k' strings of length / in the sequence are all distinct. (Good [1946], Rees [1946]) 


1.4.28. Let S be an alphabet of size m. Explain how to produce a cyclic arrangement of 
m* — m letters from S such that all four-letter strings of consecutive letters are different 
and contain at least two distinct letters. 


1.4.29. (!) Suppose that G is a graph and D is an orientation of G that is strongly 
connected. Prove that if G has an odd cycle, then D has an odd cycle. (Hint: Consider 
each pair {v;, v;+1} in an odd cycle (v1, ..., vz) of G.) 


1.4.30. (+) Given a strong digraph D, let f(D) be the length of the shortest closed walk 
visiting every vertex. Prove that the maximum value of f(D) over all strong digraphs 
with n vertices is | (n + 1)?/4| ifn > 2. (Cull [1980]) 


1.4.31. Determine the minimum n such that there is a pair of nonisomorphic n-vertex 
tournaments with the same list of outdegrees. 


1.4.32. Let p = pi,..., Pm and q = qi,...,qn be lists of nonnegative integers. The 
pair (p, g) is bigraphic if there is a simple bipartite graph in which pj,..., p, are the 
degrees for one partite set and qi,...,q, are the degrees for the other. When p has 
positive sum, prove that (p,q) is bigraphic if and only if (p’, q’) is bigraphic, where 
(p', q’) is obtained from (p, q) by deleting the largest element A from p and subtracting 
1 from each of the A largest elements of g. (Hint: Follow the method of Theorem 1.3.31.) 


1.4.33. (+) Let A and B be two m by n matrices with entries in {0,1}. An exchange 
operation substitutes a submatrix of the form (io) for a submatrix of the form (31) or vice 
versa. Prove that if A and B have the seme list of row sums and have the same list of 
column sums, then A can be transformed into B by a sequence of exchange operations. 


Interpret this conclusion in the context of bipartite graphs. (Ryser [1957]) 


1.4.34. (!) Let G and H be two tournaments on a vertex set V. Prove that dé(v) = dy (v) 
for all v € V if and only if G can be turned into H by a sequence of direction-reversals 
on cycles of length 3. (Hint: Consider a vertex of maximum outdegree in the subgraph 
of G consisting of edges oriented oppositely in H.) (Ryser [1964]) 


1.4.35. (+) Let pi,..., p, be nonnegative integers with p <--: < pn». Let p, = a Di. 
Prove that there exists a tournament with outdegrees p1,..., p, if and only if p,, > (5) 
for 1 < k <n and p’, = (5). (Hint: Use induction on )77_,[p; — (3)].) (Landau [1953)) 


1.4.36. By Proposition 1.4.30, every tournament has a king. Let T be a tournament 
having no vertex with indegree 0. 

a) Prove that if x is a king in 7, then T has another king in N~ (x). 

b) Use part (a) to prove that T has at least three kings. 

c) For each n > 3, construct a tournament T with 5-(T) > 0 and only 3 kings. 

(Comment: There exists an n-vertex tournament having exactly k kings whenever 
n> k > 1 except when k = 2 and when n = k = 4.) (Maurer [1980]) 


1.4.37. Consider the following algorithm whose input is a tournament T. 
1) Select a vertex x in T. 
2) If x has indegree 0, call x a king of T and stop. 
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3) Otherwise, delete {x} U N*t (x) from T to form T’. 
4) Run the algorithm on 7’; call the output a king in T and stop. 
Prove that this algorithm terminates and produces a king in T 


1.4.38. (+) For n € N, prove that there is an n-vertex tournament in which every vertex 
is a king if and only ifn ¢ {2, 4}. 


1.4.39. (+) Prove that every loopless digraph D has a set S of pairwise nonadjacent 
vertices such that every vertex outside S is reached from S by a path of length at most 
2. (Hint: Use strong induction on n(D), Comment: This generalizes Proposition 1.4.30.) 
(Chvatal—Lovasz [1974]) 


1.4.40. A directed graph is unipathic if for every pair of vertices x, y there is at most 
one (directed) x, y-path. Let 7, be the tournament on n vertices with the edge between 
v; and v; directed toward the vertex with larger index. What is the maximum number of 
edges in a unipathic subgraph of 7,,? How many unipathic subgraphs are there with the 
maximum number of edges? (Hint: Show that the underlying graph has no triangles.) 
(Maurer—Rabinovitch—Trotter [1980]) 


1.4.41. Let G be a tournament. Let Lo be a listing of V(G) in some order. If y immedi- 
ately follows x in Lo but y > x in G, then yx is a reverse edge. We can interchange x 
and y in the order when yx is a reverse edge (this may increase the number of reverse 
edges). Suppose that a sequence Lo, L;,--- is produced by successively switching one 
reverse edge in the current order. Prove that this always leads to a list with no reverse 
edges. Determine the maximum number of steps to termination. (Comment: In the spe- 
cial case where the vertices are numbers and each edge points to the higher number of 
the pair, the result says that successively switching adjacent numbers that are out of 
order always eventually sorts the list.) (Loeke [1995]) 


1.4.42. (!) Given an ordering o = 1,..., v, of the vertices of a tournament, let f(a) be 
the sum of the lengths of the feedback edges, meaning the sum of j — i over edges Uj V; 
such that j > i. Prove that every ordering minimizing f(a) places the vertices in non- 
increasing order of outdegree. (Hint: Determine how f (co) changes when consecutive 
elements of o are exchanged.) (Kano—Sakamoto [1983], Isaak—-Tesman [1991]) 


Chapter 2 


Trees and Distance 


2.1. Basic Properties 


The word “tree” suggests branching out from a root and never completing 
a cycle. Trees as graphs have many applications, especially in data storage, 
searching, and communication. 


2.1.1. Definition. A graph with no cycle is acyclic. A forest is an acyclic 
graph. A tree is a connected acyclic graph. A leaf (or pendant vertex) 
is a vertex of degree 1. A spanning subgraph of G is a subgraph with 
vertex set V(G). A spanning tree is a spanning subgraph that is a tree. 


JINN 1 AYN ATS 


2.1.2. Example. A tree is a penniecesdtoret: and every component of a forest 
is a tree. A graph with no cycles has no odd cycles; hence trees and forests are 
bipartite. 

Paths are trees. A tree is a path ifand only if its maximum degree is 2. A 
star is a tree consisting of one vertex adjacent to all the others. The n-vertex 
star is the biclique K,,,-1. 

A graph that is a tree has exactly one spanning tree; the full graph itself. 
A spanning subgraph of G need not be connected, and a connected subgraph of 
G need not be a spanning subgraph. For example: 

If n(G) > 1, then the empty subgraph with vertex set V(G) and edge set @ 
is spanning but not connected. 

If n(G) > 2, then a subgraph consisting of one edge and its endpoints 1 is 
connected but not spanning. a 
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PROPERTIES OF TREES 


Trees have many equivalent characterizations, any of which could be taken 
as the definition. Such characterizations are useful because we need only verify 
that a graph satisfies any one of them to prove that it is a tree, after which we 
can use all the other properties. 

We first prove that deleting a leaf from 2 tree yields a smaller tree. 


2.1.3. Lemma. Every tree with at least two vertices has at least two leaves. 
Deleting a leaf from an n-vertex tree produces a tree with n — 1 vertices. 


Proof: A connected graph witb at least two vertices has an edge. In an acyclic 
graph, an endpoint of a maximal nontrivial path has no neighbor other than its 
neighbor on the path. Hence the endpoints of a such a path are leaves. 

Let v be a leaf of a tree G, and let G’ = G — v. A vertex of degree 1 belongs 
to no path connecting two other vertices. Therefore, for u,w € V(G’), every 
u, w-path in G is also in G’. Hence G’ is connected. Since deleting a vertex 
cannot create a cycle, G’ also is acyclic. Thus G’ is a tree with n — 1 vertices. & 
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Lemma 2.1.3 implies that every tree with more than one vertex arises from 
a smaller tree by adding a vertex of degree 1 (all our graphs are finite). This 
rescues some proofs from the induction trap: growing an n + 1-vertex tree from 
an arbitrary n-vertex tree by adding a new neighbor at an arbitrary old vertex 
generates all trees with n + 1 vertices. The word “arbitrary” means that the 
discussion considers all ways of making the choice. 

Our proof of equivalence of characterizations of trees uses induction, prior 
results, a counting argument, extremality, and contradiction. 


2.1.4. Theorem. For an n-vertex graph G (with n > 1), the following are equiv- 
alent (and characterize the trees with n vertices). 
A) G is connected and has no cycles. 
B) G is connected and has n — 1 edges. 
C) G has n — 1 edges and no cycles. 
D) For u, v € V(G), G has exactly one u, v-path. 


Proof: We first demonstrate the equivalence of A, B, and C by proving that any 
two of {connected, acyclic, n — 1 edges} together imply the third. 

A => {B, C}. We use induction on n. For n = 1, an acyclic 1-vertex graph has 
no edge. Forn > 1, we suppose that the implication holds for graphs with fewer 
than » vertices. Given an acyclic connected graph G, Lemma 2.1.3 provides a 
leaf v and states that G’ = G —v also is acyclic and connected (see figure above). 
Applying the inducticn hypothesis to G’ yields e(G’) = n — 2. Since only one 
edge is incident to v, we have e(G) =n — 1. 

B = {A, C}. Delete edges from cycles of G one by one until the resulting 
graph G’ is acyclic. Since no edge of a cycle is a cut-edge (Theorem 1.2.14), G’ is 
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connected. Now the preceding paragraph implies that e(G’) = n — 1. Since we 
are given e(G) = n — 1, no edges were deleted. Thus G’ = G, and G is acyclic. . 

C = {A, B}. Let Gi,...,G, be the components of G. Since every vertex 
appears in one component, }°,n(G;) = n. Since G has no cycles, each com- 
ponent satisfies property A. Thus e(G;) = n(G;) — 1. Summing over i yields 
e(G) = >_,[a(G;) —- 1] =n —k. We are given e(G) = n —.1, sok = 1, and Gis 
connected. 

A = D. Since G is connected, each pair of vertices is connected by a path. 
If some pair is connected by more than one, we choose a shortest (total length) 
pair P, Q of distinct paths with the same endpoints. By this extremal choice, 
no internal vertex of P or Q can belong to the other path (see figure below). 
This implies that P U Q is a cycle, which contradicts the hypothesis A. 

D > A. If there is a uv, v-path for every u, v € V(G), then G is connected. If 
G has a cycle C, then G has two u, v-paths for u,v € V(C); hence G is acyclic 
(this also forbids loops). | 
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2.1.5. Corollary. a) Every edge of a tree is a cut-edge. 


b) Adding one edge to a tree forms exactly one cycle. 
c) Every connected graph contains‘a spanning tree. 


Proof: (a) A tree has no cycles, so Theorem 1.2.14 implies that every edge is a 
cut-edge. (b) A tree has a unique path linking each pair of vertices (Theorem 
2.1.4D), so joining two vertices by an edge creates exactly one cycle. (c) As in 
the proof of B>A,C in Theorem 2.1.4, iteratively deleting edges from cycles in 
a connected graph yields a connected acyclic subgraph. | 


We apply Corollary 2.1.5 to prove two results about pairs of spanning trees. 
We use subtraction and addition to indicate deletion and inclusion of edges. 


2.1.6. Proposition. If 7, T’ are spanning trees of a connected graph G and e € 
E(T) — E(T’), then there is an edge e’ € E(T’) — E(T) such that T —e+e’ 
is a spanning tree of G. 
Proof: By Corollary 2.1.5a, every edge of T is a cut-edge of T. Let U and U’ 
be the two components of T — e. Since T’ is connected, T’ has an edge e’ with 
endpoints in U and U’. Now T —e +’ is connected, has n(G) — 1 edges, and is 
a spanning tree of G. 
(In the figure below, T is bold, 7’ is solid, and they share two edges.) | 
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2.1.7. Proposition. If T, T’ are spanning trees of a connected graph G and e € 
E(T) — E(T’), then there is an edge e’ € E(T’) — E(T) such that T’ +e — e’ 
is a spanning tree of G. 


Proof: By Corollary 2.1.5b, The graph 7’ + e contains a unique cycle C. Since 
T is acyclic, there is an edge e’ € E(C) — E(T). Deleting e’ breaks the only cycle 
in T’ + e. Now T’ + e — e’ is connected and acyclic and is a spanning tree of G. 

(In the figure above, adding e to T creates a cycle C of length five; all four 
edges of C — e belong to E(T) — E(T’) and can serve as e’.) | 


The edge e’ can be chosen to satisfy the conclusions of Propositions 2.1.6— 
2.1.7 simultaneously, as illustrated in the figure between them (Exercise 37). 
The next result illustrates proof by induction using deletion of a leaf. 


2.1.8. Proposition. If T is a tree with k edges and G is a simple graph with 
6(G) > k, then T is a subgraph of G. 


Proof: We use induction on k. Basis step: k = 0. Every simple graph contains 
K,, which is the only tree with no edges. 

Induction step: k > 0. We assume that the claim holds for trees with fewer 
than k edges. Since k > 0, Lemma 2.1.3 allows us to choose a leaf v in T; let u be 
its neighbor. Consider the smaller tree T ‘= T —v. By the induction hypothesis, 
G contains T’ as a subgraph, since 5(G) >k > k — 1. 

Let x be the vertex in this copy of 7’ that corresponds to u (see illustration). 
Because 7’ has only k — 1 vertices other than u and dg(x) > k, x has a neighbor 
y in G that is not in this copy of T’. Adding the edge xy expands this copy of T’ 
into a copy of T in G, with y playing the role of v. | 
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The inequality of Proposition 2.1.8 is sharp; the graph K;, has minimum 
degree k — 1, but it contains no tree with k edges. The proposition implies 
that every n-vertex simple graph G with more than n(k — 1) edges has T as 
a subgraph (Exercise 34). Erdés and Sods conjectured the stronger statement 
that e(G) > n(k — 1)/2 forces T as a subgraph (Erdés [1964]). This has been 
proved for graphs without 4-cycles (Saclé—Wozniak [1997]). Ajtai, Komlés, and 
Szemerédi proved an asymptotic version, as reported in Soffer [2000]. 


DISTANCE IN TREES AND GRAPHS 


When using graphs to model communication networks, we want vertices to 
be close together to avoid communication delays. We measure distance using 
lengths of paths. 
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2.1.9. Definition. If G has a u, v-path, then the distance from u to v, written 
dg(u, v) or simply d(u, v), is the least length of a u, v-path. If G has no such 
path, then d(u, v) = oo. The diameter (diam G) is max,, ,eyg) d(u, v). 

The eccentricity of a vertex u, written €(u), is Maxyey(G) d(u, v). The 
radius of a graph G, written rad G, is min,ey(g) €(u). 


The diameter equals the maximum of the vertex eccentricities. In a dis- 

connected graph, the diameter and radius (and every eccentricity) are infinite, 
because distance between vertices in different components is infinite. We use 
the word “diameter” due to its use in geometry, where it is the greatest distance 
between two elements of a set. 
2.1.10. Example. The Petersen graph has diameter 2, since nonadjacent ver- 
tices have a common neighbor. The hypercube Q, has diameter k, since it takes 
k steps to change all k coordinates. The cycle C,, has diameter |n/2|. In each of 
these, every vertex has the same eccentricity, and diam G = rad G. 

For n > 3, the n-vertex tree of least diameter is the star, with diameter 2 
and radius 1. The one of largest diameter is the path, with diameter n — 1 and 
radius [(n — 1)/2]. Every path in a tree is the shortest (the only!) path between 
its endpoints, so the diameter of a treeis the length of its longest path. 

In the graph below, each vertex is labeled with its eccentricity. The radius 
is 2, the diameter is 4, and the length of the longest path is 7. a 
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To have large diameter, many edges must be missing. Thus we expect 
the complement of a graph with large diameter to have small diameter. We 
use the simple observation that a graph has diameter at most 2 if and only if 
nonadjacent vertices always have common neighbors (see also Exercise 15). 


2.1.11. Theorem. If G is a simple graph, then diam G > 3 > diam G < 3. 


Proof: When diam G > 2, there exist nusiadjacent vertices u, v € V(G) with no 
common neighbor. Hence every x « V(G) — {u, v} has at least one of {u, v} asa 
nonneighbor. This makes x adjacent in G to at least one of {u, v} in G. Since 
also uv € E(G), for every pair x, y there is an x, y-path of length at most 3 in G 
through {u, v}. Hence diam G < 3. r] 
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2.1.12. Definition. The center of a graph G is the subgraph induced by the 
vertices of minimum eccentricity. 


The center of a graph is the full graph if and only if the radius and diameter 
are equal. We next describe the centers of trees. In the induction step, we delete 
all leaves instead of just one. 


2.1.13. Theorem. (Jordan [1869]) The center of a tree is a vertex or an edge. 


Proef: We use induction on the number of vertices in a tree T. 

Basis step: n(T) < 2. With at most two vertices, the center is the entire tree. 

Induction step: n(T) > 2. Form T’ by deleting every leaf of T. By Lemma 
2.1.3, T’ is a tree. Since the internal vertices on paths between leaves of T 
remain, 7’ has at least one vertex. 

Every vertex at maximum distance in T from a vertex u € V(T) is a leaf 
(otherwise, the path reaching it from u can be extended farther). Since all 
the leaves have been removed and no path between two other vertices uses 
a leaf, ep (u) = €7(u) — 1 for every u € V(T’). Also, the eccentricity of a leaf 
in T is greater than the eccentricity of its neighbor in 7. Hence the vertices 
minimizing €7(u) are the same as the vertices minimizing €7(u). 

We have shown that T and T’ have the same center. By the induction 
hypothesis, the center of T’ is a vertex or an edge. | 


Q2 


In a communication network, large diameter may be acceptable if most 
pairs can communicate via short paths. This leads us to study the average 
distance instead of the maximum. Since the average is the sum divided by (5) 
(the number of vertex pairs), it is equivalent to study D(G) = eu.veV(G) dgtu, v). 

The sum D(G) has been called the Wiener index of G (also written W(G)). 
Wiener used it to study the boiling point of paraffin. Molecules can be modeled 
by graphs with vertices for atoms and edges for atomic bonds. Many chemical 
properties of molecules are related to the Wiener index of the corresponding 
graphs. We study the extreme values of D(G). 
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2.1.14. Theorem. Among trees with n vertices, the Wiener index D(T) = 
>.» 4, v) is minimized by stars and maximized by paths, both uniquely. 


Proof: Since a tree has n — 1 edges, it has n — 1 pairs of vertices at distance 
1, and all other pairs have distance at least 2. The star achieves this and 
hence minimizes D(T). To show that no other tree achieves this, consider a 
leaf x in T, and let v be its neighbor. If all other vertices have distance 2 
from'x, then they must be neighbors of v, and T is the star. The value is 
D(Kin—1) = (n— 1) +2("5°) = (n - 1). 
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For the maximization, consider first D(P,,). This equals the sum of the 
distances from an endpoint u to the other vertices, plus D(P,_1). We have 
Dever, du, v) = Yip i = (5). Thus D(P,) = D(P,-1) + (5). With Pascal’s 
Formula (") + (,”,) = ("{°) (see Appendix A), induction yields D(P,) = eas 
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We prove by induction on n that among n-vertex tree, P,, is the only tree 
that maximizes D(T). 

Basis step: n = 1. The only tree with one vertex is P}. 

Induction step: n > 1. Let u be a leaf of an n-vertex tree T. Now D(T) = 
D(T — u) + diyevcry dtu, v). By the induction hypothesis, D(T — u) < D(Pn-1), 
with equality if and only if T —u is a path. Thus it suffices to show that 
veviry d(u, v) is maximized only when T is a path and u is an endpoint of T. 

Consider the list of distances from u. In P,, this list is 1,2,...,n —1, all 
distinct. A shortest path from u to a vertex farthest from u contains vertices at 
all distances from u, so in any tree the set of distances from u to other vertices 
has no gaps. Thus any repetition makes }°,.y,;7) d(u, v) smaller than when u is 
a leaf of a path. When T is not a path, such a repetition occurs. | 


Over all connected n-vertex graphs, D(G) is minimized by K,. The maxi- 
mization problem reduces to what we have already done with trees. 


2.1.15. Lemma. If H is a subgraph of G, then dg(u, v) < dy(u, v). 


Proof: Every u, v-path in H appears also in G, so the shortest u, v-path in G is 
no longer than the shortest u, v-path in H. r 


2.1.16. Corollary. If G is a connected n-vertex graph, then D(G) < D(P,). 


Proof: Let T be a spanning tree of G. By Lemma 2.1.15, D(G) < D(T). By 
Theorem 2.1.14, D(T) < D(P,). a 


DISJOINT SPANNING TREES (optional) 


We have seen that every connected graph has a spanning tree. Edge- 
disjoint spanning trees provide alternate routes when an edge in the primary 
tree fails. Tutte [1961a] and Nash-Williams [1961] independently character- 
ized graphs having k pairwise edge-disjoint spanning trees (see Exercise 67). 

We describe one application of edge-disjoint spanning trees. David Gale 
devised a game marketed under the name “Bridg-it” (copyright 1960 by Has- 
senfeld Bros., Inc.—“Hasbro Toys”), Each of two players owns a rectangular 
grid of posts. The players move alternately, at each move joining two of their 
own posts by a unit-length bridge. The figure on the left below illustrates the 
board; Player 1’s posts aré solid, and Player 2’s are hollow. The object of Player 
1 is to construct a path of bridges from the left column to the right column; 
Player 2 wants a path of bridges from the top row to the bottom row. 
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Bridges cannot cross. Therefore, every bridge that is played eliminates a 
potential move for the other player. Since every path from left to right crosses 
every path from top to bottom, the players cannot both win. Note also that the 
layout of the board is symmetric in the two players. 

We argue that Player 2 cannot have a winning strategy. Suppose other- 
wise. Because the board is symmetric, Player 1 can start with any move and 
then followthe strategy of Player 2, making 2n arbitrary move if the strategy 
of Player 2 ever calls for a bridge that has already been played. Before Player 
2 can win, Player 1 wins by using the saine strategy. 

If the game is played until no further moves are possible, then some player 
must have won (Exercise 70). Since Player 2 kas no winning strategy, this 
implies that Player 1 has a winning strategy. Here we give an explicit strat- 
egythat Player 1 can use to win. (The argument holds more generally in the 
contextof “matroids”—see Theorem 8.2.46.) 
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2.1.17, Theorem. Player 1 has a winning strategy in Bridg-:t. 


Proof: We form a graph of the potential connections for Player 1. Posts on the 
same end are equivalent, so we collect the (solid) posts from the end columns 
as single vertices. We add an auxiliary edge between the ends. The figure 
on the right above illustrates that this graph is the union of two edge-disjoint 
spanning trees; we omit a technical description of the two trees. 

Together, the two trees contain edge-disjoint paths between the goal ver- 
tices. Since the auxiliary edge doesn’t really exist, we pretend Piayer 2 moved 
first and took that edge. A move by Player 2 cuts une edge e in the graph and 
makes it no longer available. This cuts one of the trees into two components. 
By Proposition 2.1.6, some edge e’ from the other tree reconnects it. 

‘Player 1 chooses such an edge e’. This makes e’ uncuttable, in effect putting 
e’ in both spanning trees. After deleting e and making e’ a double edge with one 
copy in each tree, our graph still consists of two edge-disjoint spanning trees. 
Since Player 2 cannot cut a double edge, Player 2 cannot cut both trees. Thus 
Player 1 can always defend. The figure beiow illustrates the strategy. 

The process stops when Player 1 has won or when no single edges remain 
to be cut. In the latter case the remaining edges are double edges and form a 
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spanning tree of bridges built by Player 1. hus in either case Player 1 has 
constructed a path connecting the special vertices. | 


Player 1 reconnects Player 2 cuts Player 1 reconnects 


EXERCISES 


2.1.1. (—) For eacn k, list the isomorphism ciasses of trees with maximum degree k ard 
at most six vertices. Do the same for diameter k. (Explain why there are no others.) 


2.1.2. (—) Let G be a graph. 
a) Prove that G is a tree if and only if G is connected and every edge is a cut-edge. 
b) Prove that G is a tree if and only if adding any edge with endpoints in V(G) 

creates exactly one cycle. 


2.1.3. (—) Prove that a graph is a tree ifé 
spanning tree. 


hd only if it is loopless and has exactly one 
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2.1.4. (—) Prove or disprove: Every graph v with fewer edges than vertices has a compo- 
nent that is a tree. 2 


a 


2.1.5. (—) Let G be a graph. Prove that a maximal acyclic subgraph of G consists of a 
spar.ning tree from each component of G. 


2.1.6. (—) Let T be a tree with average degree a. In terms of a, determine n(T). 
2.1.7. (—) Prove that every n-vertex graph with m edges has at least m — n + i cycles. 


2.1.8. (—) Prove that each property below characterizes forests. 
a) Every induced subgraph has a vertex of degree at most 1. 
b) Every connected subgraph is an induced subgraph. 
c) The number of components is the number of vertices minus the number of edges. 


2.1.9. (—) For 2 < k <n-—1, prove that the n-vertex graph formed by adding one vertex 
adjacent to every vertex of P,_; has a spanning tree with diameter k. 


2.1.10. (—) Let u and v be vertices in a connected n-vertex simple graph. Prove that if 
d(u, v) > 2, then d(u) + d(v) < n+1-—d(u, v). Construct examples to show that this can 
fail whenever n > 3 and d(u, v) < 2. 


2.1.11. (—) Let x and y be adjacent vertices in a graph G. For all z € V(G), prove that 
ldg(x, Zz) _ dg y, z)| < 1. 


2.1.12. (—) Compute the diameter and radius of the biclique Ky, n. 


2.1.13. (—) Prove that every graph with diameter d has an tbeependent set with at least 
[(1 + d)/2] vertices. 
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2.1.14. (—) Suppose that the processors in a computer are named by binary k-tuples, 
and pairs can communicate directly if and only if their names are adjacent in the k- 
dimensional cube Q,. A processor with name u wants to send a message to the processor 
with name v. How can it find the first step on a shortest path to v? 


2.1.15. (—) Let G be asimple graph with diameter at least 4. Prove that G has diameter 
at most 2. (Hint: Use Theorem 2.1.11.) 


2.1.16. (—) Given a simple graph G, define G’ to bc the simple graph on the same vertex 
set such that xy € E(G’) if and only if x and y are adjacent in G or have a common 
neighbor in G. Prove that diam (G’) = [diam (G)/2]. 


2.1.17. (!) Prove C => {A, B} in Theorem 2.1.4 by adding edges to connect components. 


2.1.18. (!) Prove that every tree with maximum degree A > 1 has at least A vertices of 
degree 1. Show that this is best possible by constructing an n-vertex tree with exactly 
A leaves, for each choice of n, A with n > A > 2. 


2.1.19. Prove or disprove: If n; denotes the number of vertices of degree i in a tree T, 
then y in; depends only on the number of vertices in 7. 


2.1.20. A saturated hydrocarbon is a molecule formed from k carbon atoms and / hydro- 
gen atoms by adding bonds between atoms such that each carbon atom is in four bonds, 
each hydrogen atom is in one bond, and no sequence of bonds forms a cycle of atoms. 
Prove that | = 2k + 2. (Bondy—Murty [1976, p27]) 


2.1.21. Let G be an n-vertex simple graph having a decomposition into k spanning trees. 
Suppose also that A(G) = 5(G) + 1. For 2k = n, show that this is impossible. For 2k < n, 
determine the degree sequence of G in ternis of n and k. 


2.1.22. Let T be an n-vertex tree having one vertex of each degree i with 2 <i <k; the 
remaining n — k + 1 vertices are leaves. Determine n in terms of k. 


2.1.28. Let T be a tree in which every vertex has degree 1 or degree k. Determine the 
possible values of n(T). 


2.1.24. Prove that every nontrivial tree has at least two maximal independent sets, 
with equality only for stars. (Note: maximal #4 maximum.) 


2.1.25. Prove that among trees with n vertices, the star has the most independent sets. 


2.1.26. (!) For n > 3, let G be an n-vertex graph such that every graph obtained by 
deleting one vertex is a tree. Determine e(G), and use this to determine G itself. 


2.1.27. (!) Let d,,...,d, be positive integers, with n > 2. Prove that there exists a tree 
with vertex degrees d},...,d, if and only if 5) d; = 2n — 2. 


2.1.28. Let d, > --- > d, be nonnegative integers. Prove that there exists a connected 
graph (loops and multiple edges allowed) with degree sequence d,...,d, if and only if 
>> d; is even, d, > 1, and )\ d; > 2n — 2. (Hint: Consider a realization with the fewest 
components.) Is the statement true for simple graphs? 


2.1.23. (!) Every tree is bipartite. Prove that every tree has a leaf in its larger partite 
set (in both if they have equal size). 


2.1.30. Let T be a tree in which all vertices adjacent to leaves have degree at least 3. 
Prove that T has some pair of leaves with a common neighbor. 
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2.1.31. Prove that a simple connected graph having exactly two vertices that are not 
cut-vertices is a path. 


2.1.32. Prove that an edge e of a connected graph G is a cut-edge if and only if e belongs 
to every spanning tree. Prove that ¢ is a loop if and only if e belongs to no spanning tree. 


2.1.33. (!) Let G be a connected n-vertex graph. Prove that G has exactly one cycle if 
and only if G has exactly n edges. 


2.1.34. (!) Let T be a tree with k edges, and let G be a simple n-vertex graph with more 
than n(k — 1) — (5) edges. Use Proposition 2.1.8 to prove that T € G ifn > k. 


2.1.35. (!) Let T be a tree. Prove that the vertices of T all have odd degree if and only 
if for all e € E(T), both components of T — e have odd order. 


2.1.36. (!) Let T be a tree of even order. Prove that T has exactly one spanning subgraph 
in which every vertex has odd degree. 


2.1.37. (!) Let 7, T’ be two spanning trees of a connected graph G. Fore € E(T)— E(T’), 
prove that there is an edge e’ € E(T’) — E(T) such that T’ +e —e’ and T —e+e’ are both 
spanning trees of G. 


2.1.38. Let T, T’ be two trees on the same vertex set such that d;(v) = d;(v) for each 
vertex v. Prove that T’ can be obtained from T’ using 2-switches (Definition 1.3.32) so 
that every graph along the way is also a tree. (Kelmans [1998]) 


2.1.39. (!) Let G be a tree with 2k vertices of odd degree. Prove that G decomposes into 
k paths. (Hint: Prove the stronger result that the claim holds for all forests.) 


2.1.40. (!) Let G be a tree with k leaves. Prove that G is the union of paths P;,..., Préy2] 
such that P;} 1 P; 4 @ for alli # j. (Ando-Kaneko—Gervacio [1996]) 


2.1.41. Forn > 4, let G be a simple n-vertex graph with e(G) > 2n —3. Prove that G has 
two cycles of equal length. (Chen—Jacobson—Lehel—Shreve [1999] strengthens this.) 


2.1.42. Let G be a connected Eulerian graph with at least three vertices. A vertex v in 
G is extendible if every trail beginning at v can be extended to form an Eulerian circuit 
of G. For example, in the graphs below only the marked vertices are extendible. Prove 
the following statements about G (adapted from Chartrand—Lesniak [1986, p61]). 

a) A vertex v € V(G) is extendible if and only if G — v is a forest. (Ore [1951]) 

b) If v is extendible, then d(v) = A(G). (Babler [1953]) 

c) All vertices of G are extendible if and only if G is a cycle. 

d) If G is not a cycle, then G has at most two extendible vertices. 


2.1.43. Let u be a vertex in a connected graph G. Prove that it is possible to select 
shortest paths from u to all other vertices of G so that the union of the paths is a tree. 


2.1.44. (!) Prove or disprove: If a simple graph with diameter 2 has a cut-vertex, then 
its complement has an isolated vertex. 


2.1.45. Let G be a graph having spanning trees with diameter 2 and diameter /. For 
2 <k <1, prove that G also has a spanning tree with diameter k. (Galvin) 
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2.1.46. (!) Prove that the trees with diameter 3 are the double-stars (two central 
vertices plus leaves). Count the isomorphism classes of double-stars with n vertices. 


2.1.47. (!) Diameter and radius. 

a) Prove that the distance function d(u, v) on pairs of vertices of a graph satisfies 
the triangle inequality: d(u, v) + d(v, w) = d(u, w). 

b) Use part (a) to prove that diam G < 2rad G for every graph G: 


c) For all positive integers r and d that satisfy r < d < 2r, construct a simple graph 
with radius r and diameter d. (Hint: Build a suitable graph with one cycle.) 


2.1.48. (!) For n > 4, prove that the minimum number of edges in an n-vertex graph 
with diameter 2 and maximum degree n — 2 is 2n — 4. 


2.1.49. Let G be a simple graph. Prove that rad G > 3 > radG < 2. 


2.1.50. Radius and eccentricity. 

a) Prove that the eccentricities of adjacent vertices differ by at most 1. 

b) In terms of the radius r, determine the maximum possible distance from a vertex 
of eccentricity r + 1 to the center of G. (Hint: Use a graph with one-cycle.) 


2.1.51. Let x and y be distinct neighbors of a vertex v in a graph G. 
a) Prove that if G is a tree, then 2e(v)_< €(x) + €(y). 
b) Determine the smallest graph where this inequality can fail. 


2.1.52. Let x be a vertex in a graph G, and- suppose that «(x) > rad G. 

a) Prove that if G is a trée, then x has-a neighbor with eccentricity k — 1. 

b) Show that part (a) does not hold for all graphs by constructing, for each even r 
that is at least 4, a graph with radius r in which x has eccentricity r + 2 and has no 
neighbor with eccentricity r + 1. (Hint: Use a graph with one cycle.) 


2.1.53. Prove that the center of a graph can be disconnected and can have components 
arbitrarily far apart by constructing a graph where the center consists of two vertices 
and the distance between these two vertices is k. 


2.1.54. Centers of trees. Let T be a tree. 

a) Give a noninductive proof that the center of T is a vertex or an edge. 

b) Prove that the center of T is one vertex if and only if diam T = 2rad T. 

c) Use part (a) to prove that if n(T) is odd, then every automorphism of T maps 
some vertex to itself. 


2.1.55. Given x € V(G), let s(x) = ) yg) d(x, v). The barycenter of G is the sub- 
graph induced by the set of vertices minimizing s(x) (the set is also called the median). 
a) Prove that the barycenter of a tree is a single vertex or an edge. (Hint: Study 
s(u) — s(v) when u and v are adjacent.) (Jordan [1869]) 
b) Determine the maximum distance between the center and the barycenter in a 
tree of diameter d. (Example: in the tree below, the center is the edge xy, the barycenter 
contains only z, and the distance between them is 1.) 
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2.1.56. Let T be a tree. Prove that T has a vertex v such that for all e €¢ E(T), the 
component of T — e containing v has at least [n(T)/2] vertices. Prove that either v is 
unique or there are just two adjacent such vertices. 


2.1.57. Let ni,...,, be positive integers with sum n — 1. 

a) By counting edges in complete graphs, prove that ae ("!) < (‘): 

b) Use part (a) to prove that >>, evr) 44, v) < (5) when u is a vertex of a tree 7. 
(Hint: Use strong induction on the number of vertices.) 


2.1.58. (+) Let S and T be trees with leaves {x,,..., x,} and {y,,..., yz}, respectively. 
Suppose that ds(x;, xj) = dr(yi, y;) for each pair i, j..Prove that S and T are isomorphic. 
(Smolenskii [1962]) 


2.1.59. (!) Let G be a tree with n vertices, k leaves, and maximum degree k. 
a) Prove that G is the union of k paths with a common endpoint. 
b) Determine the maximum and minimum possible values of diam G. 


2.1.60. Let G be a graph with diameter d and maximum degree k. Prove that n(G) < 
1+ [(k — 1)% — 1]k/(k — 2). (Comment: Equality holds for the Petersen graph.) 


2.1.61. (+) Let G be a graph with smallest order among k-regular graphs with girth 
at least g (Exercise 1.3.16 establishes the existence of such graphs). Prove that G has 
diameter at most g. (Hint: If d¢(x, y) > g, modify G to obtain a smaller k-regular graph 
with girth at least g.) (Erdés—Sachs [1963]) 


2.1.62. (!) Let G be a connected graph with n vertices. Define a new graph G’ having 
one vertex for each spanning tree of G, with vertices adjacent in G’ if and only if the 
corresponding trees have exactly n(G) — 2:;common edges. Prove that G’ is connected. 
Determine the diameter of G’. An example‘appears below. 


2.1.63. (!) Prove that every n-vertex graph with n + 1 edges has girth at most 
[(2n + 2)/3]. For each n, construct an example achieving this bound. 


2.1.64. (!) Let G be a connected graph that is not a tree. Prove that some cycle in G 
has length at most 2(diam G) + 1. For each k € N, show that this is best possible by 
exhibiting a graph with diameter k and girth 2k + 1. 


2.1.65. (+) Let G be a connected n-vertex graph with minimum degree k, where k > 2 
and n — 2 > 2(k +1). Prove that diam G < 3(n — 2)/(k + 1) — 1. Whenever k > 2 and 
(n — 2)/(K + 1) is an integer greater than 1, construct a graph where the bound holds 
with equality. (Moon [1965b]) 


2.1.66. Let Fi,..., Fn be forests whose union is G. Prove that m > maxycg | <2 | 


(Comment: Nash-Williams [1964] and Edmonds [1965b] proved that this bound is al- 
ways achieveable—Corollary 8.2.57). 
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2.1.67. Prove that the following is a necessary condition for the existence of k pairwise 
edge-disjoint spanning trees in G: for every partition of the vertices of G into r sets, 
there are at least k(r — 1) edges of G whose endpoints are in different sets of the partition. 
(Comment: Corollary 8.2.59 shows that this condition is also sufficient - Tutte [1961al], 
Nash-Williams [1961], Edmonds [1965c].) 


2.1.68. Can the graph below be decomposed into edge-disjoint spanning trees? Into 
isomorphic edge-disjoint spanning trees? 


2.1.69. (*) Consider the graph before Theorem 2.1.17 with 12 vertical edges and 16 
edges that are horizontal or slanted. Let g;,; be the ith edge from the top in the jth 
column of vertical edges. Let h;,; be the jth edge from the left in the ith row of horizon- 
tal/diagonal edges. Suppose that Player 1 follows the strategy of Theorem 2.1.17 and 
first takes h,,,. Player 2 deletes go2, and Player 1 takes h23. Next Player 2 deletes v3 2, 
and Player 1 takes h42. Draw the two spanning trees at this point. Given that Player 2 
next deletes go 3, list all moves available to Player 1 within the strategy. (Pritikin) 


2.1.70. (*) Prove that Bridg-it cannot end.in a tie no matter how the moves are made. 
That is, prove that when no further moves can be made, one of the players must have 
built a path connecting his/her goals. s 


2.1.71. (*) The players change the rules of Bridg-it so that a player with path between 
friendly ends is the /oser. It is forbidden to stall by building a bridge joining end posts or 
joining posts already connected by a path. Show that Player 2 has a strategy that forces 
Player 1 to lose. (Hint: Use Proposition 2.1.7 instead of Proposition 2.1.6.) (Pritikin) 


2.1.72. (+) Prove that if G,,..., G, are pairwise-intersecting subtrees of a tree G, then 
G has a vertex that belongs to all of G;,..., G,. (Hint: Use induction on k. Comment: 
This result is the Helly property for trees.) 


2.1.73. (+) Prove that a simple graph G is a forest if and only if for every pairwise in- 
tersecting family of paths in G, the paths have a common vertex. (Hint: For sufficiency, 
use induction on the size of the family of paths.) 


2.1.74. Let G be asimple n-vertex graph having n —2 edges. Prove that G has an isolated 
vertex or has two components that are nontrivial trees. Use this to prove inductively 
that G is a subgraph of G. (Comment: The claim is not true for all graphs with n — 1 
edges.) (Burns—Schuster [1977]) 


2.1.75. (+) Prove that every n-vertex tree other than K,,,-1 is contained in its comple- 
ment. (Hint: Use induction on 7 to prove a stronger statement: if T is an n-vertex tree 
other than a star, then K, contains two edge-disjoint copies of T in which the two copies 
of each non-leaf vertex of T appear at distinct vertices.) (Burns—Schuster [1978]) 


2.1.76. (+) Let S be an n-element set, and let {A;,...,A,} be n distinct subsets of S. 
Prove that S has an element x such that the sets A, U{x}, ..., A, U{x} are distinct. (Hint: 
Define a graph with vertices a;,...,a, such that a; < a; if and only if one of {A;, A;} 
is obtained from the other by adding a single element y. Use y as a label on the edge. 
Prove that there is a forest consisting of one edge with each label used. Use this to 
obtain the desired x.) (Bondy [1972a]) 
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2.2. Spanning Trees and Enumeration 


There are 2°) simple graphs with vertex set [n] = {1,..., n}, since each pair 
may or may not form an edge. How many of these are trees? In this section, 
we solve this counting problem, count spanning trees in arbitrary graphs, and 
discuss several applications. 


ENUMERATION OF TREES 


With one or two vertices, only one tree can be formed. With three vertices 
there is still only one isomorphism class, but the adjacency matrix is deter- 
mined by which vertex is the center. Thus there are three trees with vertex set 
[3]. With vertex set [4], there are four stars and 12 paths, yielding 16 trees. 
With vertex set [5], a careful study yields 125 trees. 
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Now we may see a pattern. With vertex set [n], there are n"~? trees; this is 
Cayley’s Formula. Priifer, Kirchhoff, Pélya, Renyi, and others found proofs. 
J.W. Moon [1970] wrote a book about enumerating classes of trees. We present 
a bijective proof, establishing a one-to-one correspondence between the set of 
trees with vertex set [n] and a set of known size. 

Given a set S of n numbers, there are exactly n"~* ways to form a list of 
length n — 2 with entries in S. The set of lists is denoted S"~? (see Appendix A). 
We use S”"~? to encode the trees with vertex set S. The list that results from a 
tree is its Priifer code. 


2.2.1. Algorithm. (Priifer code) Production of f(T) = (a1,..., Gy_2). 

Input: A tree T with vertex set § CN. 

Iteration: At the ith step, delete the least remaining leaf, and let a; be the 
neighbor of this leaf. | 


2.2.2. Example. After n — 2 iterations, only one of the original n — 1 edges 
remains, and we have produced a list f(T) of length n — 2 with entries in S. In 
the tree below, the least leaf is 2; we delete it and record 7. After deleting 3 and 
5 and recording 4 each time, the least leaf in the remaining 5-vertex tree is 4. 
The full code is (744171), and the vertices remaining at the end are 1 and 8. 
After the first step, the remainder of the Priifer code is the Priifer code of the 
subtree T’ with vertex set [8] — {2}. 
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If we know the vertex set S, then we can retrieve the tree from the code a. 
The idea is to retrieve all the edges. We start with the set S of isolated vertices. 
At each step we create one edge and mark one vertex. When we are ready to 
consider a;, there remain n —i + 1 unmarked vertices and n —i — 1 entries of a 
(including a;). Thus at least two of the unmarked vertices do not appear among 
the remaining entries of a. Let x be the least of these, add xa; to the list of 
edges, and mark x. After repeating this n — 2 times, two unmarked vertices 
remain; we join them to form the final edge. 

In the example above, the least element of S not in the code is 2, so the first 
edge added joins 2 and 7, and we mark 2. Now the least unmarked element 
absent from the rest is 3, and we join it to -4, which is ag. As we continue, we 
reconstruct edges in the order they were deleted to obtain a from T. 

Throughout the process, each component of the graph we have grown has 
one unmarked vertex. This is true initially, and thus adding an edge with two 
unmarked endpoints combines two components. After marking one vertex of 
the new edge, again each component has one unmarked vertex. After n — 2 
steps, we have two unmarked vertices and therefore two components. Adding 
the last edge yields a connected graph: We have built a connected graph with 
n vertices and n — 1 edges. By a 2.1.4B, it is a tree, but we have not yet 
proved that its Prtifer code is a. | 
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2.2.3. Theorem. (Cayley’s Formula [1889]). For a set S C N of size n, there are 
n"~2 trees with vertex set S. 


Proof: (Priifer [1918]). This holds for n = 1, so we assume n > 2. We prove 
that Algorithm 2.2.1 defines a bijection f. from the set of trees with vertex 
set S to the set S”~? of lists of length n — 2 from S. We must show for each 
a = (dj,...,4,-2) € S"~* that exactly one tree T with vertex set S satisfies 
f(T) =a. We prove this by induction on n. 

Basis step: n = 2. There is tree with two vertices. The Priifer code is a list 
of length 0, and it is the only such list. 

Induction step: n > 2. Computing f(T) reduces each vertex to degree 1 
and then possibly deletes it. Thus every nonleaf vertex in T appears in f(T). 
No leaf appears, because recording a leaf as a neighbor of a leaf would require 
reducing the tree to one vertex. Hence the leaves of T are the elements of S not 
in f(T). If f(T) = a, then the first leaf deleted is the least element of S not in 
a (call it x), and the neighbor of x is ay. 

We are given a € S"~* and seek all solutions to f(T) = a. We have shown 
that every such tree has x as its least leaf and has the edge xa,. Deleting x 
leaves a tree with vertex set S’ = S — {x}. Its Priifer code is a’ = (ao, ..., an_2), 
an n — 3-tuple formed from 8S’. 
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By the induction hypothesis, there exists exactly one tree 7’ having vertex 
set S’ and Priifer code a’. Since every tree with Priifer code a is formed by 
adding the edge xa; to such a tree, there is at most one solution to f(T) = a. 
Furthermore, adding xa, to JT’ does create a tree with vertex set S and Priifer 
code a, so there is at least one solution. w 


Cayley approached the problem algebraically and counted the trees by their 
vertex degrees. Priifer’s bijection also provides this information. 


2.2.4. Corollary. Given positive integers d,,...,d, summing to 2n — 2, there 
are exactly ae trees with vertex set [n] such that vertex i has degree 
d;, for each i. 


Proof: While constructing the Priifer code of a tree T, we record x each time 
we delete a neighbor of x, until we delete x itself or leave x among the last two 
vertices. Thus each vertex x appears d;(x) — 1 times in the Priifer code. 
Therefore, we count trees with these vertex degrees by counting lists of 
length n — 2 that for each i have d; — 1 copies of i. If we assign subscripts to 
the copies of each i to distinguish them, then we are permuting n — 2 distinct 
objects and there are (1 — 2)! lists. Since the copies of i are not distinguishable, 
we have counted each desired arrangement | [(d; — 1)! times, once for each way 
to order the subscripts on each type of label. (Appendix A discusses further 
aspects of this counting problem.) | 
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2.2.5.Example. Trees with fixed degrees. Consider trees with vertices 
{1, 2,3, 4, 5, 6, 7} that have degrees (3, 1, 2,1,3,1,1), respectively. We com- 
pute ce a = 30; the trees are suggested below. Only the vertices {1, 3, 5} are 
non-leaves. Deleting the leaves yields a subtree on {1, 3,5}. There are three 


such subtrees, determined by which of the three is in the middle. 
1 3 5 1 5 8 5 1 8 


To complete each tree, we add the appropriate number of leaf neighbors for 
each non-leaf to give it the desired degree. There are six ways to complete the 
first tree (pick from the remaining four vertices the two adjacent to vertex 1) 
and twelve ways to complete each of the others (pick the neighbor of vertex 3 
from the remaining four, and then pick the neighbor of the central vertex from 
the remaining three). | 


‘SPANNING TREES IN GRAPHS 


We can interpret Cayley’s Formula in another way. Since the complete 
graph with vertex set [n] has all edges that can be used in forming trees with 
vertex set [n], the number of trees with a specified vertex set of size n equals 
the number of spanning trees in a complete graph on n vertices. 
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We now consider the more general problem of computing the number of 
spanning trees in any graph G. In general, G will not have as much symmetry 
as a complete graph, so it is unreasonable to expect as simple a formula as for 
K,, but we can hope for an algorithm that provides a simple way to compute 
the answer when given a graph G. 


2.2.6. Example. Below is the kite. To count the spanning trees, observe that 
four are paths around the outside cycle in the drawing. The remaining span- 
ning trees use the diagonal edge. Since we must include an edge to each vertex 
of degree 2, we obtain four more spanning trees. The total is eight. | 


Pt oo} LL 
ZVUA SA 


In Example 2.2.6, we counted separately the trees that did or did not con- 
tain the diagonal edge. This suggests a recursive procedure to count spanning 
trees. It is clear that the spanning trees of G not containing e are simply the 
spanning trees of G — e, but how do we count the trees that contain e? The 


answer uses an elementary operation on graphs. 


1e/1 


Fs 


2.2.7. Definition. In a graph G, contraction of edge e with endpoints u, v is 
the replacement of u and v with a single vertex whose incident edges are 
the edges other than e that were incident to u or v. The resulting graph 
G -e has one less edge than G. 


u 


v 


In a drawing of G, contraction of e shrinks the edge to a single point. Con- 
tracting an edge can produce multiple edges or loops. To count spanning trees 
correctly, we must keep multiple edges (see Example 2.2.9). In other applica- 
tions of contraction, the multiple edges may be irrelevant. 

The recurrence applies for all graphs. 


2.2.8. Proposition. Let 1(G) denote the number of spanning trees of a graph 
G. Ife € E(G) is not a loop, then r(G) = 1(G —e) + t(G- e). 


Proof: The spanning trees of G that omit e are precisely the spanning trees of 
G —e. To show that G has 1t(G - e) spanning trees containing e, we show that 
contraction of e defines a bijection from the set of spanning trees of G containing 
e to the set of spanning trees of G - e. 
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When we contract e in a spanning tree that contains e, we obtain a span- 
ning tree of G - e, because the resulting subgraph of G -e is spanning and 
connected and has the right number of edges. The other edges maintain their 
identity under contraction, so no two trees are mapped to the same spanning 
tree of G-e by this operation. Also, each spanning tree of G-e arises in this way, 
since expariding the new vertex back into e yields a spanning tree of G. Since 
each spanning tree of G - e arises exactly once, the function is a bijection. ii 


2.2.9. Example. A step in the recurrence. The graphs on the right each have 
four spanning trees, so Proposition 2.2.8 implies that the kite has eight span- 
ning trees. Without the multiple edges, the computation would fail. a 


G G—e G-e 


We can save some computation time by recognizing special graphs G where 
we know 1t(G), such as the graph on the right above. 


2.2.10. Remark. If G is a connected-loopless graph with no cycle of length 
at least 3, then 1(G) is the product of the edge multiplicities. A disconnected 
graph has no spanning trees. @ 
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We cannot apply the recurrence of Proposition 2.2.8 when ¢ is a loop. For 
example, a graph consisting of one vertex and one loop has one spanning tree, 
but deleting and contracting the loop would count it twice. Since loops do not 
affect the number of spanning trees, we can delete loops as they arise. 

Counting trees recursively requires initial conditions for graphs in which 
all edges are loops. Such a graph has one spanning tree if it has only one vertex, 
and it has no spanning trees if it has more than one vertex. If a computer 
completes the computation by deleting or contracting every edge in a loopless 
graph G, then it may compute as many as 2°) terms. Even with savings from 
Remark 2.2.10, the amount of computation grows exponentially with the size 
of the graph; this is impractical. 

Another technique leads to a much faster computation. The Matrix Tree 
Theorem, implicit in the work of Kirchhoff [1847], computes r(G) using a de- 
terminant. This is much faster, because determinants of n-by-n matrices can 
be computed using fewer than n° operations. Also, Cayley’s Formula follows 
from the Matrix Tree Theorem with G = K,, (Exercise 17), but it does not follow 
easily from Proposition 2.2.8. 

Before stating the theorem, we illustrate the computation it specifies. 


2.2.11. Example. A Matrix Tree computation. Theorem 2.2.12 instructs us to 
form a matrix by putting the vertex degrees on the diagonal and subtracting 
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the adjacency matrix. We then delete a row and a column and take the deter- 
minant. When G is the kite of Example 2.2.9, the vertex degrees are 3, 3, 2, 2. 
We form the matrix on the left below and take the determinant of the matrix 
in the middle. The result is the number of spanning trees! | 


3-1-1 -1 
13 -1 -1 3 - —1 : 
1-1 2 0!/ > {7} : = 
1-10 2 -1 9 


Loops don’t affect spanning trees, so we delete them before the computa- 
tion. The proof of the theorem uses properties of determinants. 


2.2.12. Theorem. (Matrix Tree Theorem) Given a loopless graph G with vertex 
set v1,..., Un, let a;,; be the number of edges with endpoints v; and v;. Let 
Q be the matrix in which entry (i, j) is —a;,; when i # j and is d(v;) when 
i = j. If Q* is a matrix obtained by deleting row s and column t of Q, then 
t(G) = (-1)5*' det Q*. 
Proof*: We prove this only when s = ft; the general statement follows from a 
result in linear algebra (when the columns of a matrix sum to the vector 0, the 
cofactors are constant in each row—Exercise 8.6.18). 
Step 1. If D is an orientation of G, and M is the incidence matrix of D, then 
Q = MM’. With edges e1,...,m, thes entries of M are m;,; = 1 when y; is the 
tail of e;, mj; = —1 when »; is the head of ej, and m;,; = 0 otherwise. Entry 
i,j in MM is the dot product of rows i and j of M. When i + j, the product 
counts —1 for every edge of G joining the two vertices; when i = j, it counts 1 
for every incident edge and yields the degree. 


a b c d e 1 c 2 


1/-1 11 0 0 3-10 -2 
2{0 0 -1 -1 0 -t 2 -1 0 
M=s3/1o0 0 0 1 -1]? 40> )|.9 <4 2B <1 
4\1 -10 01 2 0 -1-3 


e 

Step 2. If B is an (n — 1)-by-(n — 1) submatrix of M, then det B = +1 if the 
corresponding n — 1 edges form a spanning tree of G, and.otherwise det B = 0. 
In the first case, we use induction on n to prove that det B = +1. Forn = 1, by 
convention a 0 x 0 matrix has determinant 1. For n > 1, let T be the spanning 
tree whose edges are the columns of B. Since T has at least two leaves and only 
one row is deleted, B has a row corresponding to a leaf x of T. This row has 
only one nonzero entry in B. When computing the determinant by expanding 
along this row, the only submatrix B’ with nonzero weight in the expansion 
corresponds to the spanning subtree of G — x obtained by deleting x and its 
incident edge from 7, Since B’ is an (n—2)-by-(n — 2) submatrix of the incidence 
matrix for an orientation of G — x, the induction hypothesis yields det B’ = +1. 
Since the nonzero entry in row x is +1, we obtain the same result for B. 
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If the n— 1 edges corresponding to columns of B do not form a spanning tree, 
then by Theorem 2.1.4C they contain a cycle C. We form a linear combination 
of the columns with coefficient 0 if the corresponding edge is not in C, +1 if it 
is followed forward by C, and —1 if it is followed backward by C. The result 
is total weight 0 at each vertex, so the columns are linearly dependent, which 
yields det B = 0. 

Step 3. Computation of det Q*. Let M* be the result of deleting row t of M, 
so Q* = M*(M*)'. If m < n-—1, then the determinant is 0 and there are no 
spanning subtrees, so we assume that m > n — 1. The Binet—Cauchy Formula 
(Exercise 8.6.19) computes the determinant of a product of non-square matrices 
using the determinants of square submatrices of the factors. When m > p, A 
is p-by-m, and B is m-by-p, it states that detAB = )°, det As det Bs, where 
the summ runs over all p-sets S in [m], As is the submatrix of A consisting of 
the columns indexed by S, and Bs is the submatrix of B consisting of the rows 
indexed by S. When we apply the formula to Q* = M*(M*)', the submatrix As 
is an (n — 1)-by-(n — 1) submatrix of M as discussed in Step 2, and Bs = Af. 
Hence the summation counts 1 = (+1)? for each set of n—1 edges corresponding 
to a spanning tree and 0 for all other sets of n — 1 edges. | 


Te): 


DECOMPOSITION AND GRACEFUL LABELINGS 


ut 


n 


We consider another problem about graph decomposition (Definition 
1.1.32). We can always decompose G into single edges; can we decompose 
G into copies of a larger tree T? This requires that e(T) divides e(G) and 
A(G) > A(T); is that sufficient? Even when G is e(T)-regular, this may fail 
(Exercise 20); for example, the Petersen graph does not decompose into claws. 

Haggkvist conjectured that if G is a 2m-regular graph and T is a tree with 
m edges, then E(G) decomposes into n(G) copies of T. Even the “simplest” case 
when G is a clique is still unsettled and notorious. 


2.2.13. Conjecture. (Ringel [1964]) If T is a fixed tree with m edges, then 
Kon+1 decomposes into 2m + 1 copies of T. | 


Attempts to prove Ringel’s conjecture have focused on the stronger Grace- 
ful Tree Conjecture. This implies Ringel’s conjecture and a similar statement 
about decomposing complete graphs of even order (Exercise 23). 


2.2.14. Definition. A graceful labeling of a graph G with m edges is a func- 
tion f: V(G) > {0,...,m} such that distinct vertices receive distinct num- 
bers and {| f(u) — f(v)|: uv € E(G)} = {1,..., m}. A graph is graceful if it 
has a graceful labeling. 


2.2.15. Conjecture. (Graceful Tree Conjecture—Kotzig, Ringel [1964]) Every 
tree has a graceful labeling. | 
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2.2.16. Theorem. (Rosa [1967]) Ifa tree T with m edges has a graceful labeling, 
then K2,,,; has a decomposition into 2m + 1 copies of T. 


Proof: View the vertices of K2,,; as the congruence classes modulo 2m + 1, 
arranged circularly. The difference between two congruence classes is 1 if they 
are consecutive, 2 if one class is between them, and so on up to difference m. 
We group the edges of Ka,,; by the difference between the endpoints. For 
1 < j « m, there are 2m + 1 edges with difference j. 


0 


0 4 2 8 7e 2 


From a graceful labeling of T, we define copies of T in K2,,1; the copies are 
Tp,..., Tom. The vertices of T, arek,...,k +m (mod 2m + 1), with k +i adjacent 
tok + j if and only if i is adjacent to j/in the graceful labeling of T. The copy 
To looks just like the graceful labeling-and has one edge with each difference. 
Moving to the next copy shifts each edge to another having the same difference 
by adding one to the name of each endpoint. Each difference class of edges has 
one edge in each 7,, and thus 7p, ..., Tn, decompose Kon+1. | 

Graceful labelings are known to exist for some types of trees and for some 
other families of graphs (see Gallian [1998]). It is easy to find graceful labelings 
for stars and paths. We next define a family of trees that generalizes both by 
permitting the addition of edges incident to a path. 


2.2.17. Definition. A caterpillar is a tree in which a single path (the spine) 
is incident to (or contains) every edge. 


2.2.18. Example. The vertices not on the spine of a caterpillar (the “feet”) 
are leaves. Below we show a graceful labeling of a caterpillar; in fact, every 
caterpillar is graceful (Exercise 31). The tree Y below is not a caterpillar. | 


0 14 3 10 4 8 7 


1 2131211 95 6 ¥ 


2.2.19. Theorem. A tree is a caterpillar if and only if it does not contain the 
tree Y above. 
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Proof: Let G’ denote the tree obtained from a tree G by deleting each leaf of 
G. Since all vertices that survive in G’ are non-leaves in G, G’ has a vertex of 
degree at least 3 if and only if Y appears in G. Hence G has no copy of Y if and 
only if A(G’) < 2. This is equivalent to G’ being a path, which is equivalent to 
G being a caterpillar. a 


BRANCHINGS AND EULERIAN DIGRAPHS (optional) 


Tutte extended the Matrix Tree Theorem to digraphs. His theorem reduces 
to the Matrix Tree Theorem when the digraph is symmetric (a digraph is sym- 
metric if its adjacency matrix is symmetric, and then it models a graph). There 
is a surprising connection between this theorem and Eulerian circuits. 


2.2.20. Definition. A branching or out-tree is an orientation of a tree having 
a root of indegree 0 and all other vertices of indegree 1. An in-tree is an 
out-tree with edges reversed. 


A branching with root v is a union of paths from v (Exercise 33). Each 
vertex is reached by exactly one path. The analogous result holds for in-trees; 
an in-tree is a union of paths to the root, one from each vertex. 

We state without proof Tutte’s theorem to count branchings. 


2.2.21. Theorem. (Directed Matrix Tree Theorem—Tutte [1948]) Given a loop- 
less digraph G, let Q© = D~ — A’-and Q*+ = Dt — A’, where D~ and Dt 
are the diagonal matrices of indegrees and outdegrees in G, and thei, jth- 
entry of A’ is the number of edges from v; to v;. The number of spanning 
out-trees (in-trees) of G rooted at v; is the value of each cofactor in the ith 


row of Q~ (ith column of Q*). ] 
1 3 
0 0 0 2 0 0 
o=(-1 1 0 or=(-1 1 0) 
—1 -1l 2 —1 -1 0 
Z 


2.2.22. Example. The digraph above has two spanning out-trees rooted at 1 
and two spanning in-trees rooted at 3. Every cofactor in the first row of Q™ is 
2, and every cofactor in the third column of Q* is 2. | 


Isolated vertices don’t affect Eulerian circuits. After discarding these, a 
digraph is Eulerian if and only if indegree equals outdegree at every vertex and 
the underlying graph is connected (Theorem 1.4.24). Such a digraph also is 
strongly connected, which allows us to find a spanning in-tree. We will describe 
Eulerian circuits in terms of a spanning in-tree. 
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2.2.23. Lemma. In a strong digraph, every vertex is the root of an out-tree 
(and an in-tree). 


Proof: Consider a vertex v. We iteratively add edges to grow a branching from 
v. Let S; be the set of vertices reached when i edges have been added; initialize 
= {v}. Because the digraph is strong, there is an edge leaving $; (Exercise 
1.4.10). We add one such edge to the branching and add its head to S; to obtain 
S;41. This repeats until we have reached all vertices. 
To obtain an in-tree of paths to v, reverse all edges and apply the same 
procedure; the reverse of a strong digraph is also strong. | 


The lemma constructively produces a search tree of paths from a root. The 
next section discusses search trees in more generality. 


2.2.24. Algorithm. (Eulerian circuit in directed graph) 

Input: An Eulerian digraph G without isolated vertices and a spanning in-tree 
T consisting of paths to a vertex v. 

Step 1: For each u € V(G), specify an ordering of the edges that leave u, such 
that for u ~ v the edge leaving u in T comes last. 

Step 2: Beginning at v, construct an Eulerian circuit by always exiting the 
current vertex u along the next unused edge in the ordering specified atu. 


2.2.25. Example. In the digraph bela, the bold edges form an in-tree T of 
paths to v. The edges labeled in order starting with 1 form an Eulerian circuit. 

It leaves a vertex along an edge of T only where there is no alternative. If the 
ordering at v places 1 before 10 before 13, then the algorithm traverses the 
edges in the order indicated. 2 a 


2.2.26. Theorem. Algorithm 2.2.24 always produces an Eulerian circuit. 


Proof: Using Lemma 2.2.23, we construct an in-tree T to a vertex v. We then 
apply Algorithm 2.2.24 to construct a trail. It suffices to show that the trail can 
end only at v and does so only after traversing all edges. 

When we enter a vertex u + v, the edge leaving u in T has not yet been 
used, since d+(u) = d~(u). Thus whenever we enter.u there is still a way out. 
Therefore the trail can only end at v. 

We end when we cannot continue; we are at v and have used all exiting 
edges. Since d~(v) = d*(v), we must also have used all edges entering v. Since 
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we cannot use an edge of T until it is the only remaining edge leaving its tail, 
we cannot use all edges entering v until we have finished all the other vertices, 
since T contains a path from each vertex to v. | 


2.2.27. Example. In the digraph below, every in-tree to v contains all of 
uv, yz, wx, exactly one of {zu, zv}, and exactly one of {xy, xz}. There are four in- 
trees to v. For each in-tree, we consider [](d; — 1)! = (0!)3(1!)? = 1 orderings of 
the edges leaving the vertices. Hence we can obtain one Eulerian circuit from 
each in-tree, starting along the edge e = vw fram v. The four in-trees and the 


corresponding circuits appear below. | 
M < _ In—tree has Circuit 
zu & xy (v, W,X,Z,U,X, y,Z,U) 
v y wu&exz (v, W,X, Y,Z, VU, xX, Z,u) 
zv & xy (v, W,X,Z,U, UV, xX, y, Z) 
2 p zv & xz (v, W, X,Y, Z, U,V, X, Z) 


Two Eulerian circuits are the same if the successive pairs of edges are the 
same. From each in-tree to v, Algorithm 2.2.24 generates |],,-yg)(d*(u) — D)! 
different Eulerian circuits. The last out-edge is fixed by the tree for vertices 
other than v, and since we consider only the cyclic order of the edges we may 
also choose a particular edge e to start the ordering of edges leaving v. Any 
change in the exit orderings at vertices specifies at some point different choices 
for the next edge, so the circuits are distinct. Similarly, circuits obtained from 
distinct in-trees are distinct. Hence we have generated CTTuevigy(d*(u) — 1)! 
distinct Eulerian circuits, where c is the number of in-trees to v. 

In fact, these are all the Eulerian circuits. This yields a combinatorial 
proof that the number of in-trees to each vertex of an Eulerian digraph is the 
same. The graph obtained by reversing all the edges has the same number 
of Eulerian circuits, so the number of out-trees from any vertex also has this 
value, c. Theorem 2.2.21 provides a computation of c. 


2.2.28. Theorem. (van Aardenne-Ehrenfest and de Bruijn [1951]). In an Eu- 
lerian digraph with d; = d*(v;) = d~ (v;) the number of Eulerian circuits is 
c],;(d; — 1)!, where c counts the in-trees to or out-trees from any vertex. 


Proof: We have argued that Algorithm 2.2.24 generates this many distinct 
Eulerian circuits using in-trees to vertex v (starting from v alond e). We need 
only show that this produces all Eulerian circuits. 

To find the tree and ordering that generates an Eulerian circuit C, follow 
C from e, and record the order of the edges leaving each vertex. Let T be the 
subdigraph consisting of the last edge on C leaving each vertex other than v. 
Since the last edge leaving a vertex occurs in C after all edges entering it, each 
edge in T extends to a path in T that reaches v. With n — 1 edges, T thus forms 
an in-tree to v. Furthermore, C is the circuit obtained by Algorithm 2.2.24 from 
T and the orderings of exiting edges that we recorded. | 
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EXERCISES 


2.2.1. (—) Determine which trees have Priifer codes that (a) contain only one value, (b) 
contain exactly two values, or (c) have distinct values in all positions. 


2.2.2. (—) Count the spanning trees in the graph on the left below. (Proposition 2.2.8 
provides a systematic approach, and then Remark 2.2.10 and Example 2.2.6 can be used 
to shorten the computation.) 


Xx 


2.2.3. (—) Let G be the graph on the right above. Use the Matrix Tree Theorem to find 
a matrix whose determinant is t(G). Compute t(G). 


2.2.4. (—) Let G bea simple graph with m edges. Prove that if G has a graceful labeling, 
then K2,,; decomposes into copies of G. (Hint: Follow the proof of Theorem 2.2.16.) 


books 


2.2.5. The graph on the left below was the logo of the 9th Quadrennial International 
Conference in Graph Theory, held in Kalamazoo in 2000. Count its spanning trees. 


telegram.n 


2.2.6. (!) Let G be the 3-regular graph with 4m vertices formed from m pairwise disjoint 
kites by adding m edges to link them in a ring, as shown on the right above for m = 6. 
Prove that t(G) = 2m8” 


2.2.7. (!) Use Cayley’s Formula to prove that the graph obtained from K,, by deleting an 
edge has (n — 2)n"~> spanning trees. 


2.2.8. Count the following sets of trees with vertex set [n], giving two proofs for each: 
one using the Priifer correspondence and one by direct counting arguments. 

a) trees that have 2 leaves. 

b) trees that have n — 2 leaves. 


2.2.9. Let S(m,r) denote the number of partitions of an m-element set into r nonempty 
subsets. In terms of these numbers, count the trees with vertex set {v,,..., v, } that have 
exactly k leaves. (Rényi [1959]) 


2.2.10. Compute t(K2,,,). Also compute the number of isomorphism classes of spanning 
trees of Kon. 
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2.2.11. (+) Compute 1r(K3.»,). 


2.2.12. From a graph G we define two new graphs. Let G’ be the graph obtained by 
replacing each edge of G with k copies of that edge. Let G” be the graph obtained by 
replacing each edge uv € E(G) with a u, v-path of length k through k — 1 new vertices. 
Determine 1(G’) and t(G”) in terms of 1(G) and k. 


as Z 


G G"” 


2.2.13. Consider K,,,, with bipartition X, Y, where X = {x,,...,x,} and Y = {yi,..., Yn}. 
For each spanning tree 7, we form a list f(T) of ordered pairs (written vertically). Hav- 
ing generated part of the list, let u be the least-indexed leaf in X in the remaining 
subtree, and similarly let v be the least-indexed leaf in Y. Append the pair (7) to the 
list, where a is the index of the neighbor of u and b is the index of the neighbor of v. 
Delete {u, v}. Iterate until n — 1 pairs have been generated to form f(T) (one edge re- 
mains). Part (a) shows that f is well-defined. 

a) Prove that every spanning tree of K,,,,, has a leaf in each partite set. 

b) Prove that f is a bijection from the set of spanning trees of K,,,, to ([n] x [n])"~} 
Thus K,,,, has n?"~? spanning trees. (Rényi [1966], Kelmans [1992], Pritikin [1995]) 
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2.2.14, (+) Let f(r,s) be the number of trees with vertex [n] that have partite sets of 
sizes r and s (with r +s =n). Prove that f(r,s) = es a oe if r # s. What is the 
formula when r = s? (Hint: First show that the Prifer sequence for such a tree will 
have r — 1 of its terms from the partite set of size s and s — 1 of its terms from the partite 
set of size r.) (Scoins [1962], Glicksman [1963]) 


2.2.15. Let G,, be the graph with 2n vertices and 3n — 2 edges pictured below, for n > 1. 
Prove for n > 2 that r(G,,) = 4t(G,_-1) — tT(G,,-2). (Kelmans [1967a]) 


2.2.16. For n > 1, let a, be the number of spaining trees in the graph formed from 
P, by adding one vertex adjacent to all of V(P,). For example, a; = 1, a2 = 3, and 


a3 = 8. Prove forn > 1 thata, = a,-1+1+ ee a;. Use this to prove for n > 2 that 


Gy = 38an—1 — An_2. (Comment: It is also possible to argue directly that a, = 3a,_1 — an—2.) 


2.2.17. Use the Matrix Tree Theorem to prove Cayley’s Formula. 
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2.2.18. Use the Matrix Tree Theorem to compute 1(K,,). (Lovadsz [1979, p223]—see 
Kelmans [1965] for a generalization) 


2.2.19. (+) Prove combinatorially that the number t, of trees with vertex set [n] satisfies 


the recurrence t, = thes ae tity ;- (Comment: Since 4, = n” n-2 , this proves the 


identity n”’-? = a (po) n - k)"-*-2.) (Dziobek [1917]; see ica [1979, p219]) 


2.2.20. (!) Prove that a d-regular simple graph G has a decomposition into copies of K1,4 
if and only if it is bipartite. 


2.2.21. (+) Prove that Kom—1,2 decomposes into m spanning paths. 


2.2.22. Let G be an n-vertex simple graph that decomposes into k spanning trees. Given 
also that A(G) = 6(G) + 1, determine the degree sequence of G in terms of n and k. 


2.2.23. (!) Prove that if the Graceful Tree Conjecture is true and 7 is a tree with m 
edges, then K2,, decomposes into 2m — 1 copies of T. (Hint: Apply the cyclically invariant 
decomposition of K2,,-1 for trees with m — 1 edges from the proof of Theorem 2.2.16.) 


2.2.24. Of the n"~? trees with vertex set {0,..., — 1}, how many are gracefully labeled 
by their vertex names? 


2.2.25. (!) Prove that if a graph G is graceful and Eulerian, then e(G) is congruent to 0 
or 3 mod 4. (Hint: Sum the absolute edge differences (mod 2) in two different ways.) 


2.2.26. (+) Prove that C, is graceful if and only if 4 divides n or n + 1. (Frucht [1979]) 


2.2.27. (+) Let G be the graph consisting af k 4-cycles with one common vertex. Prove 
that G is graceful. (Hint: Put 0 at the vertex of degree 2k.) 


2.2.28. Let d;,...,d, be positive integers. Prove directly that there exists a caterpillar 
with vertex degrees d,,...,d, if and only if d; = 2n — 2. 


2.2.29. Prove that every tree can be turned into a caterpillar with the same degree se- 
quence using 2-switches (Definition 1.3.32) such that each intermediate graph is a tree. 


2.2.30. A bipartite graph is drawn on a channel if the vertices of one_partite set are 
placed on one line in the plane (in some order) and the-vertices of the other partite-set 
are placed on a line parallel to it and the edges are drawn as straight-line segments» 
between them. Prove that a connected graph G can be drawn on a channel without edge 
crossings if and only if G is a caterpillar. 


2.2.31. (!) An up/down labeling is a graceful labeling for which there exists a critical 
value a such that every edge joins vertices with labels above and below a. Prove that 
every caterpillar has an up/down labeling. Prove that the 7-vertex tree that is moe a 
caterpillar has no up/down-labeling. 


2.2.32. (+) Prove that the number of isomorphism classes of n-vertex caterpillars is 
2-4 4. gln/2l-2 if, > 3. (Harary—Schwenk [1973], Kimble—Schwenk [1981]) 


2.2.33. (!) Let T be an orientation of a tree such that the heads of the edges are all 
distinct; the one vertex that is not a head is the root. Prove that T is a union of paths 
from the root. Prove that for each vertex of T, exactly one path reaches it from the root. 


2.2.34. («) Use Theorem 2.2.26 to prove that the algorithm below generates a binary 
deBruijn cycle of length 2” (the cycle in Application 1.4.25 arises in-this way). 

Start with n 0’s. Subsequently, append a 1 if doing so does not repeat 

a previous string of length n, otherwise append a 0. 
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2.2.35. (*) Tarry’s Algorithm (as presented by D.G. Hoffman). Consider a castle with 
finitely many rooms and corridors. Each corridor has two ends; each end has a door 
into a room. Each room has door(s), each of which leads to a corridor. Each room can 
be reached from any other by traversing corridors and rooms. Initially, no doors have 
marks. A robot started in some room will explore the castle using the following rules. 

1) After entering a corridor, traverse it and enter the room at the other end. 

2) Upon entering a room with all doors unmarked, mark I on the door of entry. 

3) In a room with an unmarked door, mark O on such a door and use it. 

4) In a room with all doors marked, exit via a door not marked O if one exists. 

5) In a room with all doors marked O, stop. 
Prove that the robot traverses each corridor exactly twice, once in each direction, and 
then stops. (Hint: Prove that this holds for the corridors at every reached vertex, and 
prove that every vertex is reached. Comment: All decisions are completely local; the 
robot sees nothing other than the current room or corridor. Tarry’s Algorithm [1895] 
and others are described by K6nig [1936, p35-56] and by Fleischner [1983, 1991].) 


2.3. Optimization and Trees 


“The best spanning tree” may have various meanings. A weighted graph 
is a graph with numerical labels on the edges. When building links to connect 
locations, the costs of potential links yield a weighted graph. The minimum 
cost of connecting the system is the minimum total weight of its spanning trees. 

Alternatively, the weights may represent distances. In these case we define 
the length of a path to be the sum of its edge weights. We may seek a spanning 
tree with small distances. When discussing weighted graphs, we consider 
only nonnegative edge weights. 

We also study a problem about finding good trees to eneode messages. 


MINIMUM SPANNING TREE 


In a connected weighted graph of possible communication links, all) span- 
ning trees have n — 1 edges; we seek one that minimizes or maximizes the 
sum of the edge weights. For these problems, the most naive heuristic quickly 
produces an optimal solution. 


2.3.1. Algorithm. (Kruskal’s Algorithm - for minimum spanning trees.) 
Input: A weighted connected graph. 

Idea: Maintain an acyclic spanning subgraph H, enlarging it by edges with 
low weight to form a spanning tree. Consider edges in nondecreasing order of 
weight, breaking ties arbitrarily. 

Initialization: Set E(H) = ©. 

Iteration: If the next cheapest edge joins two components of H, then include 
it; otherwise, discard it. Terminate when H is connected. | 
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Theorem 2.3.3 verifies that Kruskal’s Algorithm produces an optimal tree. 
Unsophisticated locally optimal heuristics are called greedy algorithms. 
They usually don’t guarantee optimal solutions, but this one does. 

In a computer, the weights appear in a matrix, with huge weight on “un- 
available” edges. Edges of equal weight may be examined in any order; the 
resulting trees have the same cost. Kruskal’s Algorithm begins with a forest of 
n isolated vertices. Each selected edge combines two components. 


2.3.2. Example. Choices in Kruskal’s Algorithm depend only on the order of 
the weights, not on their values. In the graph below we have used positive 
integers as weights to emphasize the order of examination of edges. The four 
cheapest edges are selected, but then we cannot take the edges of weight 5 or 
6. We can take the edge of weight 7, but then not those of weight 8 or 9. | 


12 


2.3.3. Theorem. (Kruskal [1956]). In a connected weighted graph G, Kruskal’s 
Algorithm constructs a minimum-weight spanning tree. 


Proof: We show first that the algorithm produces a tree. It never chooses an 
edge that completes a cycle. If the final graph has more than one component, 
then we considered no edge joining two of them, because such an edge would 
be accepted. Since G is connected, some such edge exists and we considered it. 
Thus the final graph is connected and acyclic, which makes it a tree. 

Let T be the resulting tree, and let T* be a spanning tree of minimum 
weight. It 7 = T*, we are done. If T + 7%, let e be the first edge chosen for T 
that is not in 7*. Adding e to T* creates one cycle C. Since T has no cycle, C 
has an edge e’ ¢ E(T). Consider the spanning tree T* +e —e’. 

Since 7* contains e’ and all the edges of T chosen before e, both e’ and 
e are available when the algorithm chooses e, and hence w(e) < w(e’). Thus 
T* +e —e’ is a spanning tree with weight at most 7* that agrees with T for a 
longer initial list of edges than 7* does. 

Repeating this argument eventually yields a minimum-weight spanning 
tree that agrees completely with 7. Phrased extremally, we have proved that 
the minimum spanning tree agreeing with T the longest is T itself. | 


2.3.4.* Remark. To implement Kruskal’s Algorithm, we first sort the m edge 
weights. We then maintain for each vertex the label of the component contain- 
ing it, accepting the next cheapest edge if its endpoints have different labels. We 
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merge the two components by changing the label of each vertex in the smaller 
component to the label of the larger. Since the size of the component at least 
doubles when a label changes, each label changes at most Ign times, and the 
total number of changes is at most n lgn (we use lg for the base 2 logarithm). 

With this labeling method, the running time for large graphs depends on 
the time to sort m numbers. With this cost included, other algorithms may be 
faster than Kruskal’s Algorithm. In Prim’s Algorithm (Exercise 10, due also to 
Jarnik), a spanning tree is grown from a single vertex by iteratively adding the 
cheapest edge that incorporates a new vertex. Prim’s and Kruskal’s Algorithms 
have similar running times when edges are pre-sorted by weight. 

Both Bortvka [1926] and Jarnik [1930] posed and solved the minimum 
spanning tree problem. Boruvka’s algorithm picks the next edge by consider- 
ing the cheapest edge leaving each component of the current forest. Modern 
improvements use clever data structures to merge components quickly. Fast 
versions appear in Tarjan [1984] for when the edges are pre-sorted and in 
Gabow-—Galil—Spencer—Tarjan [1986] for when they are not. Thorough discus- 
sion and further references appear in Ahuja—Magnanti—Orlin [1993, Chapter 
13]. More recent developments appear in Karger—Klein—Tarjan [1995]. ia 


SHORTEST PATHS 


1e/math_books 


How can we find the shortest route from one location to another? How can 
we find the shortest routes from our home to every place in town? This requires 
finding shortest paths from one vertex to all other vertices in a weighted graph. 
Together, these paths form a spanning tree. 

Dijkstra’s Algorithm (Dijkstra [1959] and Whiting—Hillier [1960]) solves 
this problem quickly, using the observation that the u, v-portion of a shortest 
u, z-path must be a shortest u, v-path. It finds optimal routes from u to other 
vertices z in increasing order of d(u,z). The distance d(u, z) in a weighted 
graph is the minimum sum of the weights on the edges in a u, z-path (we con- 
sider only nonnegative weights). 


2.3.5. Algorithm. (Dijkstra’s Algorithm—distances from one vertex.) 
Input: A graph (or digraph) with nonnegative edge weights and a starting 
vertex u. The weight of edge xy is w(xy); let w(xy) = 00 if xy is not an edge. 
Idea: Maintain the set S of vertices to which a shortest path from u is known, 
enlarging S to include all vertices. To do this, maintain a tentative distance 
t(z) from u to each z ¢ S, being the length of the shortest u, z-path yet found. 
Initialization: Set S = {u}; t(u) = 0; t(z) = w(uz) forz Fu. 
Iteration: Select a vertex v outside S such that t(v) = min,¢s t(z). Add v to S. 
Explore edges from v to update tentative distances: for each edge vz with z ¢ S, 
update t(z) to min{t(z), t(v) + w(vz)}. 

The iteration continues until § = V(G) or until t(z) = oo for every z ¢ S. 
At the end, set d(u, v) = t(v) for all v. , | 
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2.3.6. Example. In the weighted graph below, shortest paths from u are found 
to the other vertices in the order a, b, c,d, e, with distances 1,3,5,6,8, respec- 
tively. To reconstruct the paths, we only need the edge on which each shortest 
path arrives at its destination, because the earlier portion of a shortest u, z-path 
that reaches z on the edge vz is a shortest u, v-path. 

The algorithm can maintain this information by recording the identity of 
the “selected vertex” whenever the tentative distance to z is updated. When z 
is selected, the vertex that was recorded when t(z) was last updated is the pre- 
decessor of z on the u, z-path of length d(u, z). In this example, the final edges 
on the paths to a, b,c,d,e generated by the algorithm are ua, ub, ac, ad, de, 
respectively, and these are the edges of the spanning tree generated from vu. @ 


With the phrasing given in Algorithm 2.3.5, Dijkstra’s Algorithm works 
also for digraphs, generating an out-tree rooted at u if every vertex is reachable 
from u. The proof works for graphs and for digraphs. The technique of proving 
a stronger statement in order to make an inductive proof work is called “loading 
the induction hypothesis”. 5 


um 


I 
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2.3.7. Theorem. Given a (di)graph Gand a vertex u € V(G), Dijkstra’s Algo- 
rithm computes d(u, z) for every z € V(G). 


Proof: We prove the stronger statement that at each iteration, 

1) for z € S, t(z) = d(u, z), and 

2) for z ¢ S, t(z) is the least length of a u, z-path reaching z directly from S. 
We use induction on k = |S|. Basis step: k = 1. From the initialization, S = {u}, 
d(u,u) = t(u) = 0, and the least length of a u, z-path reaching z directly from S$ 
is t(z) = w(u, z), which is infinite when wz is not an edge. 

Induction step: Suppose that when |S| = k, (1) and (2) are true. Let v be 
a vertex among z ¢ S such that t(z) is smallest. The algorithm now chooses v; 
let S’ = SU {v}. We first argue that d(u, v) = t(v). A shortest u, v-path must 
exit S before reaching v. The induction hypothesis states that the length of the 
shortest path going directly to v from S is t(v). The induction hypothesis and 
choice of v also guarantee that a path visiting any vertex outside S and later 
reaching v has length at least t(v). Hence d(u, v) = t(v), and (1) holds for S$’. 

To prove (2) for S’, let z be a vertex outside S other than v. By the hypothe- 
sis, the shortest u, z-path reaching z directly from S has length t(z) (oo if there 
is no such path). When we add v to S, we must also consider paths reaching z 
from v. Since we have now computed d(u, v) = t(v), the shortest such path has 
length t(v) + w(vz), and we compare this with the previous value of t(z) to find 
the shortest path reaching z directly from S’. 
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We have verified that (1) and (2) hold for the new set S’ of size k + 1; this 
completes the induction step. | 


Call 


The algorithm maintains the condition that d(u,x) < t(z) for allx € § 
and z ¢ S; hence it selects vertices in nondecreasing order of distance from u. 
It computes d(u, v) = co when v is unreachable from u. The special case for 
unweighted graphs is Breadth-First Search from u. Here both the algorithm 
and the proof (Exercise 17) have simpler descriptions. 


2.3.8. Algorithm. (Breadth-First Search—BFS) 

Input: An unweighted graph (or digraph) and a start vertex u. 

Idea: Maintain a set R of vertices that have been reached but not searched 
and a set S of vertices that have been searched. The set R is maintained as a 
First-In First-Out list (queue), so the first vertices found are the first vertices 
explored. 2 
Initialization: R = {u}, S = @, d(u, u)-= 0. 

Iteration: As long as R 4 @, we search from the first vertex v of R. The 
neighbors of v not in § U R are added to the back of R and assigned distance 
d(u, v) + 1, and then v is removed from the front of R and placed in S. a 
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The largest distance from a vertex u to another vertex is the eccentricity 
€(u). Hence we can compute the diameter of a graph by running Breadth-First 
Search from each vertex. 

Like Dijkstra’s Algorithm, BFS from u yields a tree T in which for each 
vertex v, the u, v-path is a shortest u, v-path. Thus the graph has no additional 
edges joining vertices of a u, v-path in T. 

Dijkstra’s Algorithm figures prominently in the solution of another well- 
known optimization problem. 


2.3.9. Application. A mail carrier must traverse all edges in a road network, 
starting and ending at the Post Office. The edges have nonnegative weights 
representing distance or time. We.seek a closed walk of minimum total length 
that uses all the edges. This is the Chinese Postman Problem, named in 
honor of the Chinese mathematician Guan Meigu [1962], who proposed it. 

If every vertex is even, then the graph is Eulerian and the answer is the 
sum of the edge weights. Otherwise, we must repeat edges. Every traversal 
is an Eulerian circuit of a graph obtained by duplicating edges. Finding the 
shortest traversal is equivalent to finding the minimum total weight of edges 
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whose duplication will make all vertex degrees even. We say “duplication” be- 
cause we need not use an edge more than twice. If we use an edge three or more 
times in making all vertices even, then deleting two of those copies will leave 
all vertices even. There may be many ways to choose the duplicated edges. Um 


2.3.10. Example. In the example below, the eight outer vertices have odd de- 
gree. If we match them around the outside to make the degrees even, the extra 
costis4+4+4+4=160rl+7+7+1= 16. Wecan do better by using all 
the vertical edges, which total only 10. | 


Adding an edge from an odd vertex'to an even vertex makes the even vertex 
odd. We must continue adding edges until we complete a trail to an odd vertex. 
The duplicated edges must consist of 3 ‘a collection of trails that pair the odd 
vertices. We may restrict our attention to paths pairing up the odd vertices 
(Exercise 24), but the paths may need to intersect. 

Edmonds and Johnson [1973] described a way to solve the Chinese Post- 
man Problem. If there are only two odd vertices, then we can use Dijkstra’s 
Algorithm to find the shortest path between them and solve the problem. If 
there are 2k odd vertices, then we still can use Dijkstra’s Algorithm to find the 
shortest paths connecting each pair of odd vertices; these are candidates to use 
in the solution. We use these lengths as weights on the edges of Kz, and then 
our problem is to find the minimum total weight of k edges that pair up these 
2k vertices. This is a weighted version of the maximum matching problem 
discussed in Section 3.3. An exposition appears in Gibbons [1985, p163-165]. 


TREES IN COMPUTER SCIENCE (optional) 
Most applications of trees in computer science use rooted trees. 


2.3.11. Definition. A rooted tree is a tree with one vertex r chosen as root. 
For each vertex v, let P(v) be the unique v, r-path. The parent of v is its 
neighbor on P(v); its children are its other neighbors. Its ancestors are 
the vertices of P(v) — v. Its descendants are the vertices u such that P(u) 
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contains v. The leaves are the vertices with no children. A rooted plane 
tree or planted tree is a rooted tree with a left-to-right ordering specified 
for the children of each vertex. 


After a BFS from u, we view the resulting tree T as rooted at u. 


2.3.12. Definition. A binary tree is a rooted plane tree where each vertex 
has at most two children, and each child of a vertex is designated as its left 
child or right child. The subtrees rooted at the children of the root are 
the left subtree and the right subtree of the tree. A k-ary tree allows 
each vertex up to k children. 


In many applications of binary trees, all non-leaves have exactly two chil- 
dren (Exercise 26). Binary trees permit storage of data for quick access. We 
store each item at a leaf and access it-by following the path from the root. We 
encode the path by recording 0 when we move to a left child and 1 when we 
move to a right child. The search time is the length of this code word for the 
leaf. Given access probabilities among n items, we want to place them at the 
leaves of a rooted binary tree to minimize the expected search time. 

Similarly, given large computer files and limited storage, we want to encode 
characters as binary lists to minimize total length. Dividing the frequencies 
by the total length of the file yields probabilities. This encoding problem then 
reduces to the problem above. 

The length of code words may vary; we need a way to recognize the end 
of the current word. If no code word is an initial portion of another, then the 
current word ends as soon as the bits since the end of the previous word form a 
code word. Under this prefix-free condition, the binary code words correspond 
to the leaves of a binary tree using the left/right encoding described above. The 
expected length of a message is }— p;l;, where the ith item has probability p; 
and its code has length /;. Constructing the optimal code is surprisingly easy. 


2.3.13. Algorithm. (Huffman’s Algorithm [1952]—prefix-free coding) 

Input: Weights (frequencies or probabilities) p1,..., Dn. 

Output: Prefix-free code (equivalently, a binary tree). 

Idea: Infrequent itemis should have longer codes; put infrequent items deeper 
by combining them into parent nodes. 

Initial case: When n = 2, the optimal length is one, with 0 and 1 being the 
codes assigned to the two items (the tree has a root and two leaves; n = 1 can 
also be used as the initial case). 


102 Chapter 2: Trees and Distance 


Recursion: When n > 2, replace the two least likely items p, p’ with a sin- 
gle item q of weight p+ p’. Treat the smaller set as a problem with n — 1 
items. After solving it, give children with weights p, p’ to the resulting leaf 
with weight g. Equivalently, replace the code computed for the combined item 
with its extensions by 1 and 0, assigned to the items that were replaced. | 


2.3.14. Example. Huffman coding. Consider eight items with frequencies 
5,1, 1,7, 8, 2,3,6. Algorithm 2.3.13 combines items according to the tree on 
the left below, working from the bottom up. First the two items of weight 1 
combine to form one of weight 2. Now this and the original item of weight 2 
are the least likely and combine to form an item of weight 4. The 3 and 4 now 
combine, after which the least likely elements are the original items of weights 
5 and 6. The remaining combinations in order are 5+ 6 = 11, 7+ 7 = 14, 
8+ 11= 19, and 14+ 19 = 38. 

From the drawing of this tree on the right, we obtain code words. In their 
original order, the items have code words 100, 00000, 00001, 01, 11, 0001, 001, 
and 101. The expected length is )> p,;l; = 90/33. This is less than 3, which 
would be the expected length of a code using the eight words of length 3. | 


8:11 
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2.3.15. Theorem. Given a probability distribution {p;} on n items, Huffman’s 
Algorithm produces the prefix-free code with minimum expected length. 


Proof: We use induction on n. Basis step: n = 2. We must send a bit to send a 
message, and the algorithm encodes each item as a single bit, so the optimum 
is expected length 1. 

Induction step: n > 2. Suppose that the algorithm computes the optimal 
code when given a distribution for n — 1 items. Every code assigns items to 
the leaves of a binary tree. Given a fixed tree with n leaves, we minimize the 
expected length by greedily assigning the messages with probabilities p; > 

- > p, to leaves in increasing order of depth. Thus every optimal code has 
least likely messages assigned to leaves of greatest depth. Since every leaf at 
maximum depth has another leaf as its sibling and permuting the items at a 
given depth does not change the expected length, we may assume that two least 
likely messages appear as siblings at greatest depth. 

Let T be an optimal tree for pi,..., p,, with the least likely items p, and 
Pn-1 located as sibling leaves at greatest depth. Let T’ be the tree obtained from 
T by deleting these leaves, and let q1,...,q,-1 be the probability distribution 
obtained by replacing {p,_1, pn} by gn-1 = Pn-1 + Pn. The tree T’ yields a code 
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for {q;}. The expected length for T is the expected length for T’ plus q,_1, since if 
k is the depth of the leaf assigned qg,-1, we lose kg,_1 and gain (k + 1)(pn-1+ Pn) 
in moving from T" to T. 

This holds for each choice of T’, so it is best to use the tree T’ that is optimal 
for {g;}. By the induction hypothesis, the optimal choice for T’ is obtained by ap- 
plying Huffman’s Algorithm to {q;}. Since the replacement of {Pnz_-1, Pn} by Gn—1 
is the first step of Huffman’s Algorithm for {p;}, we conclude that Huffman’s 


Algorithm generates the optimal tree T for {p;}. | 
T <k—- ’ 
Pj Qn-1 qj 
«<-k+1—- 
Pn Pn-1 


Huffman’s Algorithm computes an optimal prefix-free code, and its ex- 
pected length is close to the optimum over all types of binary codes. Shannon 
[1948] proved that for every code with binary digits, the expected length is at 
least the entropy of the discrete probability distribution {p;}, defined to be 
— > pi lg p; (Exercise 31). When each _p; is a power of 1/2, the Huffman code 
meets this bound exactly (Exercise 30), 
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EXERCISES 


2.3.1. (—) Assign integer weights to the edges of K,. Prove that the total weight on 
every cycle is even if and only if the total weight on every triangle is even. 


2.3.2. (—) Prove or disprove: If T is a minimum-weight spanning tree of a weighted 
graph G, then the u, v-path in T is a minimum-weight u, v-path in G. 


2.3.3. (—) There are five cities in a network. The cost of building a road directly between 
i and j is the entry a;,; in the matrix below. An infinite entry indicates that there is a 
mountain in the way and the road cannot be built. Determine the least cost of making 
all the.cities reachable from each other. 


0 3 5 11 9 
3 0 3 9 8 
5 38 0 oc 10 
11 9 wo 0 7 
9 8 10 7 O 


2.3.4. (—) In the graph below, assign weights (1, 1, 2, 2, 3, 3, 4, 4) to the edges in two 
ways: one way so that the minimum-weight spanning tree is unique, and another way 
so that the minimum-weight spanning tree is not unique. 


2.3.5. (—) There are five cities in a network. The travel time for traveling directly from 
i to j is the entry a; ; in the matrix below. The matrix is not symmetric (use directed 
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graphs), and a;,; = oo indicates that there is no direct route. Determine the least travel 
time and quickest route from i to j for each pair i, j. 

0 10 20 o 17 

7 O 5 22 38 

14 13 O 15 27 

380 co 17 O 10 

co 15 12 8 O 


2.3.6. (!) Assign integer weights to the edges of K,. Prove that on every cycle the total 
weight is even if and only if the subgraph consisting of the edges with odd weight is 
a spanning complete bipartite subgraph. (Hint: Show that every component of the 
subgraph consisting of the edges with even weight is a complete graph.) 


2.3.7. Let G be a weighted connected graph with distinct edge weights. Without using 
Kruskal’s Algorithm, prove that G has only one minimum-weight spanning tree. (Hint: 
Use Exercise 2.1.34.) 


2.3.8. Let G be a weighted connected graph. Prove that no matter how ties are broken 
in choosing the next edge for Kruskal’s Algorithm, the list of weights of a minimum 
spanning tree (in nondecreasing order) is unique. 


2.3.9. Let F be a spanning forest of a connected weighted graph G. Among all edges 
of G having endpoints in different components of F, let e be one of minimum weight. 
Prove that among all the spanning trees of G that contain F, there is one of minimum 
weight that contains e. Use this to give another proof that Kruskal’s Algorithm works. 


2.3.10. (!) Prim’s Algorithm grows a spanning tree from a given vertex of a connected 
weighted graph G, iteratively adding the cheapest edge from a vertex already/reached 
to a vertex not yet reached, finishing when all the vertices of G have been reached. 
(Ties are broken arbitrarily.) Prove that Prim’s Algorithm produces a minimum-weight 
spanning tree of G. (Jarnik [1930], Prim [1957], Dijkstra [1959], independently). 


2.3.11. For a spanning tree T in a weighted graph, let m(T) denote the maximum among 
the weights of the edges in T. Let x denote the minimum of m(T) over all spanning trees 
of a weighted graph G. Prove that if T is a spanning tree in G with minimum total 
weight, then m(T) = x (in other words, T also minimizes the maximum weight). Con- 
struct an example to show that the converse is false. (Comment: A tree that minimizes 
the maximum weight is called a bottleneck or minimax spanning tree.) 


2.3.12. In a weighted complete graph, iteratively select the edge of least weight such 
that the edges selected so far form a disjoint union of paths. After n — 1 steps, the result 
is aspanning path. Prove that this algorithm always gives a minimum-weight spanning 
path, or give an infinite family of counterexamples where it fails. 


2.3.13. (!) Let T be a minimum-weight spanning tree in G, and let T’ be another span- 
ning tree in G. Prove that T’ can be transformed into T by a list of steps that exchange 
one edge of 7’ for one edge of 7, such that the edge set is always a spanning tree and 
the total weight never increases. 


2.3.14. (!) Let C be a cycle in a connected weighted graph. Let e be an edge of maximum 
weight on C. Prove that there is a minimum spanning tree not containing e. Use this 
to prove that iteratively deleting a heaviest non-cut-edge until the remaining graph is 
acyclic produces a minimum-weight spanning tree. 
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2.3.15. Let T be a minimum-weight spanning tree in a weighted connected graph G. 
Prove that T omits some heaviest edge from every cycle in G. 


2.3.16. Four people must cross a canyon at night on a fragile bridge. At most two people 
can be on the bridge at once. Crossing requires carrying a flashlight, and there is only 
one flashlight (which can cross only by being carried). Alone, the four people cross in 
10, 5, 2, 1 minutes, respectively. When two cross together, they move at the speed of the 
slower person. In 18 minutes, a flash flood coming down the canyon will wash away the 
bridge. Can the four people get across in time? Prove your answer without using graph 
theory and describe how the answer can be found using graph theory. 


2.3.17. Given a starting vertex u in an unweighted graph or digraph G, prove directly 
(without Dijkstra’s Algorithm) that Algorithm 2.3.8 computes d(u, z) for all z € V(G). 


2.3.18. Explain how to use Breadth-First Search to compute the girth of a graph. 


2.3.19. (+) Prove that the following algorithm correctly finds the diameter of a tree. 
First, run BFS from an arbitrary vertex w to find a vertex u at maximum distance from 
w. Next, run BFS from u to reach a vertex v at maximum distance from u. Report 
diam T = d(u, v). (Cormen—Leiserson—Rivest [1990, p476]) 


2.3.20. Minimum diameter spanning tree. An MDST is a spanning tree where the max- 
imum length of a path is as small as possible. Intuition suggests that running Dijkstra’s 
Algorithm from a vertex of minimum sii aed (a center) will produce an MDST, but 
this may fail. 

a) Construct a 5-vertex example of an. unweighted graph (edge weights all equal 
1) such that Dijkstra’s Algorithm can be run from some vertex of minimum eccentricity 
and produce a spanning tree that does not have minimum diameter. 

b) Construct a 4-vertex example of a weighted graph such that Dijkstra’s algorithm 
cannot produce an MDST when run from apy vertex. 
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2.3.21. Develop a fast algorithm to test whether a graph is bipartite. The graph is given 
by its adjacency matrix or by lists of vertices and their neighbors. The algorithm should 
not need to consider an edge more than twice. 


2.3.22. (—)-Solve the Chinese Postman Problem in the k-dimensional cube Q, under the 
condition that every edge has weight 1. 


2.3.23. Every morning the Lazy Postman takes the bus to the Post Office. From there, 
he chooses a route to reach home as quickly as possible (NOT ending at the Post Office). 
Below is a map of the streets along which he must deliver mail, giving the number of 
minutes required to walk each block whether delivering or not. P denotes the post office 
and H denotes home. What must the edges traveled more than once satisfy? How many 
times will each edge be traversed in the optimal route? 


c g Dg 
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2.3.24. (—) Explain why the optimal trails pairing up odd vertices in an optimal solution 
to the Chinese Postman Problem may be assumed to be paths. Construct a weighted 
graph with four odd vertices where the optimal solution to the Chinese Postman Problem 
requires duplicating the edges on two paths that have a common vertex. 


2.3.25. Let G be a rooted tree where every vertex has 0 or k children. Given k, for what 
values of n(G) is this possible? 


2.3.26. Find a recurrence relation to count the binary trees with n + 1 leaves (here each 
non-leaf vertex has exactly two children, and the left-to-right order of children matters). 
When n = 2, the possibilities are the two trees below. 
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2.3.27. Find a recurrence relation for the number of rooted plane trees with n vertices. 
(As in a rooted binary tree, the subtrees obtained by deleting the root of a rooted plane 
tree are distinguished by their order from left to right.) 


2.3.28. (—) Compute a code with minimum expected length for a set of ten messages 
whose relative frequencies are 1, 2, 3, 4,5, 5, 6, 7, 8,9. What is the expected length of a 
message in this optimal code? 


2.3.29. (—) The game of Scrabble has 106 tiles as listed below. This does not agree 
with English; “S” is less frequent here, for example, to improve the game. Pretend 
that these are the relative frequencies in English, and compute a prefix-free code of 
minimum expected length for transmitting messages. Give the answer by listing the 
relative frequency for each length of code word. Compute the expected length of the code 
(per text character). (Comment: ASCII coding uses five bits per letter; this code will 
beat that. Of course, ASCII suffers the handicap of including codes for punctuation.) 


ABCDEFGHIJKLMNOPQRSTUYVYWXYZ@® 
92241223291142648s2212646422d1221 «2 
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2.3.30. Consider n messages occurring with probabilities p),..., p,, such that each p; 
is a power of 1/2 (each p; > 0 and )> p; = 1). 

a) Prove that the two least likely messages have equal probability. 

b) Prove that the expected message length of the Huffman code for this distribution 


is —- pi lg pj. 


2.3.31. (+) Suppose that n messages occur with probabilities p,,..., p, and that the 
words are assigned distinct binary code words. Prove that for every code, the expected 
length of a code word with respect to this distribution is at least — =) pi lg p;. (Hint: Use 
induction on n.) (Shannon [1948]) 
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Matchings and Factors 


3.1. Matchings and Covers 


Within a set of people, some pairs are compatible as roommates; under 
what conditions can we pair them all up? Many applications of graphs involve 
such pairings. In Example 1.1.9 we considered the problem of filling jobs with 
qualified applicants. Bipartite graphs have a natural vertex partition into two 
sets, and we want to know whether the two sets can be paired using edges. In 
the roommate question, the graph need not be bipartite. 


3.1.1. Definition. A matching in a graph G is a set of ron-loop edges with 
no shared endpoints. The vertices incident to the edges of a matching M 
are saturated by M; the others are unsaturated (we say M-saturated 
and M-unsaturated). A perfect matching in a graph is a matching that 
saturates every vertex. 
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3.1.2. Example. Perfect matchings in Kn. Consider K,,, with partite sets X = 
{x1,...,X,} and Y = {y,,..., y,}. A perfect matching defines a bijection from X 
to Y. Successively finding mates for x1, x2,... yields n! perfect matchings. 
Each matching is represented by a permutation of [n], mapping i to j when 
x; is matched to yj. We can express the matchings as matrices. With X and Y 
indexing the rows and columns, we let position i, j be 1 for each edge x;y, in 
a matching M to obtain the corresponding matrix. There is one 1 in each row 


and each column. a 
X1 XX. X33 X4 0100 
x 0010 
Y 100 0 
yi yo y3 Ya 000 1 
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3.1.3. Example. Perfect matchings in complete graphs. Since it has odd order, 
Kon+1 has no perfect matching. The number f, of perfect matchings in Kg, is 
the number of ways to pair up 2n distinct people. There are 2n — 1 choices for 
the partner of v2,, and for each such choice there are f,,_; ways to complete 
the matching. Hence f, = (2n — 1)fp_1 forn > 1. With fo = 1, it follows by 
induction that f, = (2n — 1) - (2n — 3)--- (1). 

There is also a counting argument for f,,. rom an ordering of 2n people, we 
form a matching by pairing the first two, the next two, and so on. Each ordering 
thus yields one matching. Each matching is generated by 2"n! orderings, since 
change the order of the pairs or the order within a pair does not change the 
resulting matching. Thus there are f,, = (2n)!/(2"n!) perfect matchings. | 


The usual drawing of the Petersen graph shows a perfect matching and 
two 5-cycles; counting the perfect matchings takes some effort (Exercise 14). 
The inductive construction of the hypercube Q, readily yields many perfect 
matchings (Exercise 16), but counting them exactly is difficult. The graphs 
below have even order but no perfect matchings. 


AAD OX 


MAXIMUM MATCHINGS 
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A matching is a set of edges, so its size is the number of edges. We can 
seek a large matching by iteratively selecting edges whose endpoints are not 
used by the edges already selected, until no more are available. This yields a 
maximal matching but maybe not a maximum matching. 


3.1.4. Definition. A maximal matching in a graph is a matching that cannot 
be enlarged by adding an edge. A maximum matching is a matching of 
maximum size among all matchings in the graph. 


A matching M is maximal if every edge not in M is incident to an edge al- 
ready in M. Every maximum matching is a maximal matching, but the converse 
need not hold. 


3.1.5. Example. Maximal 4 maximum. The smallest graph having a maximal 
matching that is not a maximum matching is P,. If we take the middle edge, 
then we can add no other, but the two end edges form a larger matching. Below 
we show this phenomenon in P, and in Pg. a 
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In Example 3.1.5, replacing the bold edges by the solid edges yields a larger 
matching. This gives us a way to look for larger matchings. 


3.1.6. Definition. Given a matching M, an M-alternating path is a path that 
alternates between edges in M and edges not in M. An M-alternating path 
whose enpoints are unsaturated by M is an M-augmenting path. 


Given an M-augmenting path P, we can replace the edges of M in P with 
the other edges of P to obtain a new matching M’ with one more edge. Thus 
when M is a maximum matching, there is no M-augmenting path. 

In fact, we prove next that maximum matchings are characterized by the 
absence of augmenting paths. We prove this by considering two matchings 
and examining the set of edges belonging to exactly one of them. We define 
this operation for any two graphs with the same vertex set. (The operation is 
defined in general for any two sets; see Appendix A.) 


3.1.7. Definition. If G and H are graphs with vertex set V, then the sym- 
metric difference GaH is the graph with vertex set V whose edges are 
all those edges appearing in exactly one of G and H. We also use this no- 
tation for sets of edges; in particular, if M and M’ are matchings, then 
MaM' =(M — M’)U(M’—- M). 
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3.1.8. Example. In the graph below, M is the matching with five solid edges, 
M' is the one with six bold edges, and the dashed edges belong to neither M nor 
M’'. The two matchings have one common edge e; it is not in their symmetric 
difference. The edges of MAM’ form a cycle of length 6 and a path of length 3. s 


® 


3.1.9. Lemma. Every component of the symmetric difference of two matchings 
is a path or an even cycle. 


Proof: Let M and M’ be matchings, and let F = MaM’. Since M and M’ are 
matchings, every vertex has at most one incident edge from each of them. Thus 
F has at most two edges at each vertex. Since A(F) < 2, every component of F 
is a path or a cycle. Furthermore, every path or cycle in F alternates between 
edges of M — M’ and edges of M’ — M. Thus each cycle has even length, with 
an equal number of edges from M and from M’. | 


3.1.10. Theorem. (Berge [1957]) A matching M in a graph G is a maximum 
matching in G if and only if G has no M-augmenting path. 
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Proof: We prove the contrapositive of each direction; G has a matching larger 
than M if and only if G has an M-augmenting path. We have observed that an 
M-augmenting path can be used to produce a matching larger than M. 

For the converse, let M’ be a matching in G larger than M; we construct an 
M-augmenting path. Let F = MaM’. By Lemma 3.1.9, F consists of paths and 
even cycles; the cycles have the same number of edges from M and M’. Since 
| ' | > |M|, F must have a component with more edges of M’ than of M. Such 
a component can only be a path that starts and ends with an edge of M’; thus 
it is an M-augmenting path in G. | 


HALLS MATCHING CONDITION 


When we are filling jobs with applicants, there may be many more ap- 
plicants than jobs; successfully filling the jobs will not use all applicants. To 
model this problem, we consider an X, Y-bigraph (bipartite graph with biparti- 
tion X, Y—Definition 1.2.17), and we seek a matching that saturates X. 

If a matching M saturates X, then for every S C X there must be at least 
|S| vertices that have neighbors in S, because the vertices matched to S must be 
chosen from that set. We use Ng(S) or simply N(S) to denote the set of vertices 
having a neighbor in S. Thus |N(S)| > |S | is a necessary condition. 

The condition “For all S ¢ X, |N(S)| = |S|? is Hall’s Condition. Hall 
proved that this obvious necessary condition is also sufficient (TONCAS). 


3.1.11. Theorem. (Hall’s Theorem—P. Hall [1935]) An X, Y-bigraph G has a 
matching that saturates X if and only if |N(S)| > |S| for all S C X. 


Proof: Necessity. The |S| vertices matched to S must lie in N(S). 

Sufficiency. To prove that Hall’s Condition is sufficient, we prove the con- 
trapositive. If Mis a maximum matching in G and M does not saturate X, then 
we obtain a set S C X such that |N(S)| < |S|. Let u € X be a vertex unsatu- 
rated by M. Among all the vertices reachable from u by M-alternating paths in 
G, let S consist of those in X, and let T consist of those in Y (see figure below 
with M in bold). Note that u € S. 


S 
X u 


Y —_——_ 
T = N(S) 

We claim that M matches T with S — {u}. The M-alternating paths from u 
reach Y along edges not in M and return to X along edges in M. Hence every 
vertex of S — {u} is reached by an edge in M from a vertex in T. Since there is 
no M-augmenting path, every vertex of T is saturated; thus an M-alternating 
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path reaching y € 7 extends via M to a vertex of S. Hence these edges of M 
yield a bijection from T to S — {u}, and we have |T| = |S — {u}|. 

The matching between 7 and S — {u} yields T C N(S). In fact, T = N(S). 
Suppose that y € Y — T has a neighbor v € S. The edge vy cannot be in M, 
since u is unsaturated and the rest of S is matched to T by M. Thus adding vy 
to an M-alternating path reaching v yields an M-alternating path to y. This 
contradicts y ¢ 7, and hence vy cannot exist. 

With T = N(S), we have proved that |N(S)| = |7| = |S| — 1 < |S| for this 
choice of $. This completes the proof of the contrapositive. | 


One can also prove sufficiency by assuming Hall’s Condition, supposing 
that no matching saturates X, and obtaining a contradiction. As we have seen, 
lack of a matching saturating X yields a violation of Hall’s Condition. Con- 
tradicting the hypothesis usually means that the contrapositive of the desired 
implication has been proved. Thus we have stated the proof in that language. 


3.1.12. Remark. Theorem 3.1.11 implies that whenever an X, Y-bigraph has 
no matching saturating X, we can verify this by exhibiting a subset of X with 
too few neighbors. 

Note also that the statement and proof permit multiple edges. | 


Many proofs of Hall’s Theorem have been published; see Mirsky [1971, p38] 
and Jacobs [1969] for summaries. A proof by M. Hall [1948] leads to a lower 
bound on the number of matchings that saturate X, as a function of the vertex 
degrees. We consider algorithmic aspects in Section 3.2. 

When the sets of the bipartition have the same size, Hall’s Theorem is the 
Marriage Theorem, proved originally by Frobenius [1917]. The name arises 
from the setting of the compatibility relation between a set of n men and a set 
of n women. If every man is compatible with k women and every woman is 
compatible with k men, then a perfect matching must exist. Again multiple 
edges are allowed, which enlarges the scope of applications (see Theorem 3.3.9 
and Theorem 7.1.7, for example). 


3.1.13. Corollary. For k > 0, every k-regular bipartite graph has a perfect 
matching. 


Proof: Let G be a k-regular X, Y-bigraph. Counting the edges by endpoints in 
X and by endpoints in Y shows that k |X| = k |Y|, so |X| = |Y|. Hence it suffices 
to verify Hall’s Condition; a matching that saturates X will also saturate Y and 
be a perfect matching. 

Consider S C X. Let m be the number of edges from S to N(S). Since G is 
k-regular, m = k|S|. These m edges are incident to N(S),som < k|N(S)|. Hence 
k|S| < k|N(S)|, which yields |N(S)| > |S| when k > 0. Having chosen S$ C X 
arbitrarily, we have established Hall’s condition. | 


One can also use contradiction here. Assuming that G has no perfect 
matching yields a set § C X such that |N(S)| < |S|. The argument obtaining a 
contradiction amounts to a rewording of the direct proof given above. 
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MIN-MAX THEOREMS 


When a graph G does not have a perfect matching, Theorem 3.1.10 al- 
lows us to prove that M is a maximum matching by proving that G has no 
M-augmenting path. Exploring all M-alternating paths to eliminate the possi- 
bility of augmentation could take a long time. 

We faced a similar situation when proving that a graph is not bipartite. 
Instead of checking all possible bipartitions, we can exhibit an odd cycle. Here 
again, instead of exploring all M-alternating paths, we would prefer to exhibit 
an explicit structure in G that forbids a matching larger than M. 


3.1.14. Definition. A vertex cover of a graph G is a set Q C V(G) that 
contains at least one endpoint of every edge. The vertices in Q cover E(G). 


In a graph that represents a road network (with straight roads and no 
isolated vertices), we can interpret the problem of finding a minimum vertex 
cover as the problem of placing the minimum number of policemen to guard the 
entire road network. Thus “cover” means “watch” in this context. 

Since no vertex can cover two edges of a matching, the size of every vertex 
cover is at least the size of every matching. Therefore, obtaining a matching 
and a vertex cover of the same size PROVES that each is optimal. Such proofs 


exist for bipartite graphs, but not for all graphs. 


oS 


1 
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3.1.15. Example. Matchings and vertex covers. In the graph on the left be- 
low we mark a vertex cover of size 2 and show a matching of size 2 in bold. 
The vertex cover of size 2 prohibits matchings with more than 2 edges, and 
the matching of size 2 prohibits vertex covers with fewer than 2 vertices. As 
illustrated on the right, the optimal values differ by 1 for an odd cycle. The 
difference can be arbitrarily large (Exercise 3.3.10). | 


3.1.16. Theorem. (Koénig [1931], Egervary [1931]) If G is a bipartite graph, 
then the maximum size of a matching in G equals the minimum size of a 
vertex cover of G. 


Proof: Let G be an X, Y-bigraph. Since distinct vertices must be used to cover 
the edges of a matching, |Q| > |M| whenever Q is a vertex cover and M isa 
matching in G. Given a smallest vertex cover Q of G, we construct a matching 
of size |Q| to prove that equality can always be achieved. 

Partition Q by letting R = QN X and T = QPY. Let H and H’ be the 
subgraphs of G induced by R U (Y — T) and T U (X — R), respectively. We use 
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Hall’s Theorem to show that H has a matching that saturates R into Y — T 
and H’ has a matching that saturates 7. Since H and H’ are disjoint, the two 
matchings together form a matching of size |Q| in G. 

Since R UT is a vertex cover, G has no edge from Y — T to X — R. For each 
S ¢ R, we consider N;,(S), which is contained in Y — 7. If |Ny(S)| < |S|, then 
we can substitute N;,(S) for S in Q to obtain a smaller vertex cover, since Nz(S) 
covers all edges incident to S that are not covered by T. 

The minimality of Q thus yields Hall’s Condition in H, and hence H has 
a matching that saturates R. Applying the same argument to H’ yields the 
matching that saturates T. | 


T | Ny(S) 


As graph theory continues to develop, new proofs of fundamental results 
like the Kénig—Egervary Theorem appear; see Rizzo [2000]. 


1e/m 


a 
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3.1.17. Remark. A min-max relation is a theorem stating equality between 
the answers to a minimization problem and a maximization problem over a 
class of instances. The Kénig—Egervary Theorem is such a relation for vertex 
covering and matching in bipartite graphs. 

For the discussions in this text, we think of a dual pair of optimization 
problems as a maximization problem M and a minimization problem N, defined 
on the same instances (such as graphs), such that for every candidate solution 
M to M and every candidate solution N to N, the value of M is less than or 
equal to the value of N. Often the “value” is cardinality, as above where M is 
maximum matching and N is minimum vertex cover. 

When M and N are dual problems, obtaining candidate solutions M and N 
that have the same value PROVES that M and N are optimal solutions for that 
instance. We will see many pairs of dual problems in this book. A min-max 
relation states that, on some class of instances, these short proofs of optimality 
exist. These theorems are desirable because they save work! Our next objective 
is another such theorem for independent sets in' bipartite graphs. | 


INDEPENDENT SETS AND COVERS 


We now turn from matchings to independent sets. The independence 
number of a graph is the maximum size of an independent set of vertices. 
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3.1.18. Example. The independence number of a bipartite graph does not al- 
ways equal the size of a partite set. In the graph below, both partite sets have 
size 3, but we have marked an independent set of size 4. | 


No vertex covers two edges of a matching. Similarly, no edge contains two 
vertices of an independent set. This yields another dual covering problem. 


3.1.19. Definition. An edge cover of G is a set L of edges such that every 
vertex of G is incident to some edge of L. 


We say that the vertices of G are covered by the edges of L. In Example 
3.1.18, the four edges incident to the marked vertices form an edge cover; the 
remaining two vertices are covered “for free”. 

Only graphs without isolated vertices have edge covers. A perfect matching 
forms an edge cover with n(G)/2 edges. In general, we can obtain an edge cover 
by adding edges to a maximum matching. 


a 


me/ 


3.1.20. Definition. For the optimal sizes of the sets in the independence and 
covering problems we have defined, we use the notation below. 


maximum size of independent set «a(G) 


maximum size of matching a’(G) 
minimum size of vertex cover B(G) 
minimum size of edge cover B'(G) 


A graph may have many independent sets of maximum size (Cs has five of 
them), but the independence number a(G) is a single integer (a(Cs) = 2). The 
notation treats the numbers that answer these optimization problems as graph 
parameters, like the order, size, maximum degree, diameter, etc. Our use of 
a’(G) to count the edges in a maximum matching suggests a relationship with 

‘the parameter a(G) that counts the vertices in a maximum independent set. 
We explore this relationship in Section 7.1. 

We use £(G) for minimum vertex cover due to its interaction with maxi- 
mum matching. The “prime” goes on £’(G) rather than on 8(G) because B(G) 
counts a set of vertices and 8’(G) counts a set of edges. 

In this notation, the Kénig—Egervary Theorem states that a’(G) = B(G) 
for every bipartite graph G. We will prove that also a(G) = f’(G) for bipartite 
graphs without isolated vertices. Since no edge can coyer two vertices of an 
independent set, the inequality B’(G) > a(G) is immediate. (When S € V(G), 
we often use S to denote V(G) — S, the remaining vertices). 
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3.1.21. Lemma. In a graph G, S € V(G) is an independent set if and only if 5 
is a vertex cover, and hence a(G) + B(G) = n(G). 


Proof: If S is an independent set, then every edge is incident to at least one 
vertex of 5. Conversely, if S covers all the edges, then there are no edges joining 
vertices of S. Hence every maximum independent set is the complement of a 
minimum vertex cover, and a(G) + B(G) = n(G). a 


The relationship between matchings and edge coverings is more subtle. 
Nevertheless, a similar formula holds. 


3.1.22. Theorem. (Gallai [1959]) If G is a graph without isolated vertices, then 

a’(G) + B'(G) = n(G). 

Proof: From a maximum matching M, we will construct an edge cover of size 
n(G) — |M|. Since a smallest edge cover is no bigger than this cover, this will 
imply that B’(G) < n(G) —a‘(G). Also, from a minimum edge cover L, we 
will construct a matching of size n(G) — |L|. Since a largest matching is no 
smaller than this matching, this will imply that a’(G) > n(G) — B’(G). These 
two inequalities complete the proof. 

Let M be a maximum matching in G. We construct an edge cover of G by 
adding to M one edge incident to each‘unsaturated vertex. We have used one 
edge for each vertex, except that each-edge of M takes care of two vertices, so 
the total size of this edge cover is n(G)— |M|, as desired. 

Now let L be a minimum edge cover. If both endpoints of an edge e belong 
to edges in L other than e, then e ¢-L, since L — {e} is also an edge cover. 
Hence each component formed by edges of L has at most one vertex of degree 
excéeding 1 and is a star (a tree with at most one non-leaf). Let k be the number 
of these components. Since L has one edge for each non-central vertex in each 
star, we have |L| = n(G) — k. We form a matching M of size k = n(G) — |L| by 
choosing one edge from each star in L. | 


3.1.23. Example. The graph below has 13 vertices. A matching of size 4 ap- 
pears in bold, and adding the solid edges yields an edge cover of size 9. The 
dashed edges are not needed in the cover. The edge cover consists of four stars; 
from each we extract one edge (bold) to form the matching. | 


3.1.24. Corollary. (Kénig [1916]) If G is a bipartite graph with no isolated 
vertices, then a(G) = B’(G). 


Proof: By Lemma 3.1.21 and Theorem 3.1.22, a(G) + B(G) = a’(G) + B(G). 
Subtracting the Kénig—Egervary relation a'(G) = B(G) completes the proof. & 
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DOMINATING SETS (optional) 


The edges covered by one vertex in a vertex cover are the edges incident to 
it; they form a star. The vertex cover problem can be described as covering the 
edge set with the fewest stars. Sometimes we instead want to cover the vertex 
set with fewest stars. This is equivalent to our next graph parameter. 


3.1.25. Example. A company wants to establish transmission towers in a re- 
mote region. The towers are located at inhabited buildings, and each inhabited 
building must be reachable. If a transitter at x can reach y, then also one at y 
can reach x. Given the pairs that can reach each other, how many transmitters 
are needed to cover all the buildings? 

A similar problem comes from recreational mathematics: How many 
queens are needed to attack all squares on a chessboard? (Exercise 56). | 


3.1.26. Definition. In a graph G, a set S C V(G) is a dominating set if every 
vertex not in S has a neighbor in S. The domination number y (G) is the 
minimum size of a dominating set in G. 


3.1.27. Example. The graph G below-has a minimal dominating set of size 4 
a 


Berge [1962] introduced the notion of domination. Ore [1962] coined this 
terminology, and the notation y(G) appeared in an early survey (Cockayne— 
Hedetniemi [1977]). An entire book (Haynes—Hedetniemi-—Slater [1998]) is 
devoted to domination and its variations. 


3.1.28. Example. Covering the vertex set with stars may not require as many 
stars as covering the edge set. When a graph G has no isolated vertices, every 
vertex cover is a dominating set, so y(G) < B(G). The difference can be large; 
v(K,) = 1, but B(K,) =n —1. 7 


When studying domination as an extremal problem, we try to obtain bounds 
in terms of other graph parameters, such as the order and the minimum de- 
gree. A vertex of degree k dominates itself and k other vertices; thus every 
dominating set in a k-regular graph G has size at least n(G)/(k + 1). For every 
graph with minimum degree k, a greedy algorithm produces a dominating set 
not too much bigger than this. 


3.1.29. Definition. The closed neighborhood N[v] of a vertex v in a graph 
is N(v) U {v}; it is the set of vertices dominated by v. 
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3.1.30. Theorem. (Arnautov [1974], Payan [1975]) Every n-vertex graph with 
minimum degree k has a dominating set of size at most n*?4* 


Proof: (Alon [1990]) Let G be a graph with minimum degree k. Given S ¢ 
V(G), let U be the set of vertices not dominated by S. We claim that some 
vertex y outside S dominates at least |U|(k + 1)/n vertices of U. Each vertex 
in U has at least k neighbors, so }°,,., |NIv]| => |U|(k + 1). Each vertex of G 
is counted at most n times by these |U| sets, so some vertex y appears at least 
|U| (k + 1)/n times and satisfies the claim. 

We iteratively select a vertex that dominates the most of the remaining un- 
dominated vertices. We have proved that when r undominated vertices remain, 
after the next selection at most r(1— (k + 1)/n) undominated vertices remain. 


Hence after n™¢+» steps the number of undominated vertices is at most 
— k¢L yn in(k+1)/(k4+ 1) —In(k+1) _ _n_ 
n(1 —) < ne = 


The selected vertices and these remaining undominated vertices together form 
a dominating set of size at most n #24», i 


3.1.31. Remark. This bound is also proved by probabilistic methods in Theo- 
rem 8.5.10. Caro—Yuster—West |2000] showed that for large k the total domina- 
tion number satisfies a bound asymptotic to this. Alon [1990] used probabilistic 
methods to show that this bound is asymptotically sharp when k is large. 
Exact bounds remain of interest for small k. Among connected n-vertex 
graphs, 6(G) > 2 implies y(G) < 2n /5 (McCuaig—Shepherd [1989], with seven 
small exceptions), and 5(G) > 3 implies y(G) < 3n/8 (Reed [1996]). Exercise 
53 requests constructions achieving these bounds. | 


Many variations on the concept of domination are studied. In Example 
3.1.25, for example, one might want the transmitters to be able to communicate 
with each other, which requires that they induce a connected subgraph. 


3.1.32. Definition. A dominating set S in G is 
a connected dominating set if G[S] is connected, 
an independent dominating set if G[S] is independent, and 
a total dominating set if G[S] has no isolated vertex. 


. Each variation adds a constraint, so dominating sets of these types are at 
least as large as y(G). Exercises 54—60 explore these variations. Studying 
independent dominating sets amounts to studying maximal independent sets. 
This leads to a nice result about claw-free graphs. 


3.1.33. Lemma. A set of vertices in a graph is an independent dominating set 
if and only if it is a maximal independent set. 


Proof: Among independent sets, S is maximal if and only if every vertex outside 
S has a neighbor in S, which is the condition for S to be a dominating set. UH 
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3.1.34. Theorem. (Bollobas—Cockayne [1979]) Every claw-free graph has an 
independent dominating set of size y(G). 


Proof: Let S be a minimum dominating set in a claw-free graph G. Let S’ be a 
maximal independent subset of S. Let T = V(G) — N(S’). Let T’ be a maximal 
independent subset of S. 

Since T’ contains no neighbor of S’, S’ U T’ is independent. Since S’ is 
maximal in S, we have S C N(S’). Since T’ is maximal in 7, T’ dominates 7. 
Hence S’ UT’ is a dominating set. 

It remains to show that |S’ U T’| < y(G). Since S’ is meximal in S, T’ is 
independent, and G is claw-free, each vertex of S — S’ has at most one neighbor 
in T’. Since S is dominating, each vertex of T’ has at least one neighbor in 


S — S’. Hence |T’| < |S — S’|, which yields |S’ UT’| < |S| = y(G). a 
EXERCISES 


1_books 


3.1.1. (—) Find a maximum matching in eh graph below. Prove that it is a maximum 
matching by exhibiting an optimal solution to the dual problem (minimum vertex cover). 
Explain why this proves that the cai ¢ optimal. 


3.1.2. (—) Determine the minimum size of a maximal matching in the cycle C,. 


3.1.3. (—) Let S be the set of vertices saturated by a matching M in a graph G. Prove 
that some maximum matching also saturates all of 5. Must the statement be true for 
every maximum matching? 


3.1.4. (—) For each of a, a’, 8, 6’, characterize the simple graphs for which the value of 
the parameter is 1. 


3.1.5. (—) Prove that a(G) > RT for every graph G. 


3.1.6. (—) Let T be a tree with n vertices, and let k be the maximum size of an indepen- 
dent set in 7. Determine a’(7) in terms of n and k. 


3.1.7. (—) Use Corollary 3.1.24 to prove that a graph G is bipartite if and only ifa(H) = 
f(A) for every subgraph H of G with no isolated vertices. 


3.1.8. (!) Prove or disprove: Every tree has at most one perfect matching. 


3.1.9. (!) Prove that every maximal matching in a graph G has at least a'(G)/2 edges. 
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3.1.10. Let M and N be matchings in a graph G, with |M| > |N|. Prove that there exist 
matchings M’ and N’ in G such that |M’| = |M| -— 1, |N’| = |N| +1, and M’, N’ have the 
same union and intersection (as edge sets) as M, N. 


3.1.11. Let C and C’ be cycles in a graph G. Prove that CaC’ decomposes into cycles. 


3.1.12. Let C and C’ be cycles of length k in a graph with girth k. Prove that CaC’ isa 
single cycle if and only if CM C’ is a single path. (Jiang [2001]) 


3.1.13. Let M and M’ be matchings in an X, Y-bigraph G. Suppose that M saturates 
SC X and that M’ saturates T C Y. Prove that G has a matching that saturates § UT. 
For example, below we show M as bold edges and M’ as thin edges; we can saturate 
SUT by using one edge from each. 


S S 
T T 


3.1.14, Let G be the Petersen graph. In Example 7.1.9, analysis by cases is used to 
show that if M is a perfect matching in G, then G — M = Cs + Cs. Assume this. 

a) Prove that every edge of G lies in four 5-cycles, and count the 5-cycles in G. 

b) Determine the number of perfect matchings in G. 


3.1.15. a) Prove that for every perfect matching M in Q, and every coordinate i ¢€ [k], 
there are an even number of edges in M whose endpoints differ in coordinate i. 
b) Use part (a) to count the perfect matchings in Qs. 


3.1.16. For k > 2, prove that Q, has at least 2") perfect matchings. 


3.1.17. The weight of a vertex in Q; is the number of 1s in its label. Prove that for 
every perfect matching in Q,, the number of edges matching words of weight i to words 
of weight i + 1 is (5) forO <i<k—1. 


3.1.18. (!) Two people play a game on a graph G, alternately choosing distinct vertices. 
Player 1 start by choosing any vertex. Each subsequent choice must be adjacent to the 
preceding choice (of the other player). Thus together they follow a path. The last player 
able to move wins. 

Prove that the second player has a winning strategy if G has a perfect matching, 
and otherwise the first player has a winning strategy. (Hint: For the second part, the 
first player should start with a vertex omitted by some maximun: matching.) 


3.1.19. (!) Let A = (Aj,..., Am) be a collection of subsets of a set Y. A system of 
distinct representatives (SDR) for A is a set of distinct elements a;,...,a,, in Y such 
that a; € A;. Prove that A has an SDR if and only if |U;<sA;| > |S| for every S 
{1,..., m}. (Hint: Transform this to a graph problem.) 


3.1.20. The people in a club are planning their summer vacations. Trips t;,...,¢, are 
available, but trip t; has capacity n;. Each person likes some of the trips and will travel 
on at most one. In terms of which people like which trips, derive a necessary and 
sufficient condition for being able to fill all trips (to capacity) with people who like them. 


3.1.21. (!) Let G be an X, Y-bigraph such that |//(S)| > |S| whenever @ 4 S c X. Prove 
that every edge of G belongs to some matching that saturates X. 
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3.1.22. Prove that a bipartite graph G has a perfect matching if and only if |N(S)| > |S| 
for all S C V(G), and present an infinite class of examples to prove that this characteri- 
zation does not hold for all graphs. 


3.1.23. (+) Alternative proof of Hall’s Theorem. Consider a bipartite graph G with 
bipartition X, Y, satisfying |N(S)| > |S| for every S ¢ X. Use induction on |X| to prove 
that G has a matching that saturates X. (Hint: First consider the case where |N(S)| > 
|S| for every proper subset S of X. When this does not hold, consider a minimal nonempty 
T ¢ X such that |N(T)| = |T|.) (M. Hall [1948], Halmos—Vaughan [1950]) 


3.1.24. (!) A permutation matrix P is a 0,1-matrix having exactly one 1 in each row 
and column. Prove that a square matrix of nonnegative integers can be expressed as 
the sum of k permutation matrices if and only if all row sums and column sums equal k. 


3.1.25. (!) Adoubly stochastic matrix Q is a nonnegative real matrix in which every 
row and every column sums to 1. Prove that a doubly stochastic matrix Q can be ex- 
pressed Q = c) Pi +--+ +CmPm, where ci,..., Cm are nonnegative real numbers summing 
to 1 and P;,..., P, are permutation matrices. For example, 


1/2 1/3 1/6\ ,/10 0\ ,/010\ ,/001 
0 1/6 5/6)=5(0 0 1}]+ 5(0 0 1]+2[(0 1 0 
1/2 1/2 0 0 1 0 100) ®\i 0 o 


(Hint: Use induction on the number of nonzero entries in Q.) (Birkhoff [1946], von 
Neumann [1953]) < 
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3.1.26. (!) A deck of mn cards with m values and n suits consists of one card of each 
value in each suit. The cards are dealt into-an n-by-m array. 
a) Prove that there is a set of m cards, « ;one in each column, having distinct values. 
b) Use part (a) to prove that by a sequance of exchanges of cards of the same value, 
the cards can be rearranged so that each column consists of n cards of distinct suits. 


(Enchev [1997]) oO 
3.1.27. (!) Generalizing Tic-Tac-Toe. A positional game consists of a set X = x1,..., Xn 
of positions and a family W,,..., W,, of winning sets of positions (Tic-Tac-Toe has nine 


positions and eight winning sets). Two players alternately choose positions; a player 
wins by collecting a winning set. 

Suppose that each winning set has size at least a and each position appears in at 
most b winning sets (in Tic-Tac-Toe, a = 3 and b = 4). Prove that Player 2 can force a 
draw ifa > 2b. (Hint: Form an X, Y-bigraph G, where Y = {wj,..., Wm} U{wj,..., wi}, 
with edges x;w; and x; w, whenever x; € W;. How can Player 2 use a matching in G? 
Comment: This result implies that Player 2 can force a draw in d-dimensiona] Tic-Tac- 
Toe when the sides are long enough.) 


3.1.28. (!) Exhibit a perfect matching in the graph below or give a short proof that it 
has none. (Lovész—Plummer [1986, p7]) 
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3.1.29. (!) Use the Kénig—Egervary Theorem to prove that every bipartite graph G has 
a matching of size at least e(G)/A(G). Use this to conclude that every subgraph of K,, » 
with more than (k — 1)n edges has a matching of size at least k. 


3.1.30. (!) Determine the maximum number of edges in a simple bipartite graph that 
contains no matching with k edges and no star with / edges. (Isaak) 


3.1.31. Use the Kénig—Egervary Theorem to prove Hall’s Theorem. 


3.1.32. (!) In an X, Y-bigraph G, the deficiency of a set S is def(S) = |S| — |N(S)|; note 
that def(@) = 0. Prove that a’(G) = |X| — maxsc x def(S). (Hint: Form a bipartite graph 
G’ such that G’ has a matching that saturates X if and only if G has a matching of the 
desired size, and prove that G’ satisfies Hall’s Condition.) (Ore [1955]) 


3.1.33. (!) Use Exercise 3.1.32 to prove the Kénig—Egervdry Theorem. (Hint: Obtain a 
matching and a vertex cover of the same size from a set with maximum deficiency.) 


3.1.34. (!) Let G be an X, Y-bigraph with no isolated vertices, and define deficiency as 
in Exercise 3.1.32. Prove that Hall’s Condition holds for a matching saturating X if and 
only if each subset of Y has deficiency at most |Y| — |X|. 


3.1.35. Let G be an X, Y-bigraph. Prove that G is (k + 1)K2-free if and only if each 
S C X has a subset of size at most k with neighborhood N(S). (Liu—Zhou [1997]) 


3.1.36. Let G be an X, Y-bigraph having a matching that saturates X. Letting m = |X|, 
prove that G has at most (3) edges belonging to no matching of size m. Construct 
examples to show that this is best possible for every m. 


3.1.37. (+) Let G be an X, Y-bigraph having a matching that saturates X. 

a) Let S and T be subsets of X such that |N(S)| = |S| and |N(7)| = |T|. Prove that 
IN(SAT)| =|SATI. 

b) Prove that X has some vertex x auih that every edge incident to x belongs to 
some maximum matching. (Hint: Consider a minimal nonempty set S C X such that 
|N(S)| = |S|, if any exists.) 


3.1.38. (+) An island of area n has n married hunter/farmer couples. The Ministry of 
Hunting divides the island into n equal-sized hunting regions. The Ministry of Agricul- 
ture divides it into n equal-sized farming regions. The Ministry of Marriage requires 
that each couple receive two overlapping regions. By Exercise 3.1.25, this is always 
possible. Prove a stronger result: guarantee a pairing where each couple’s two regions 
share area at least 4/(n + 1)? when n is odd and 4/[n(n + 2)] when n is even. Prove also 
that no larger common area can be guaranteed; the example below achieves equality for 


n = 3, (Marcus—Ree [1959], Floyd [1990]) 
5 .25 .25 
5 .25 .25 
0 6 6 


3.1.39. Let G be a nontrivial simple graph. Prove that a(G) < n(G) — e(G)/A(G). 
Conclude that a(G)-< n(G)/2 when G also is regular. (P. Kwok) 


3.1.40. Let G be a bipartite graph. Prove that a(G) = n(G)/2 if and only if G hasa 
perfect matching. 
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3.1.41. A connected n-vertex graph has exactly one cycle if and only if it has exactly 
n edges (Exercise 2.1.30). Let C be the cycle in such a graph G. Assuming the result 
of Exercise 3.1.40 for trees, prove that a(G) > |[n(G)/2], with equality if and only if 
G — V(C) has a perfect matching. 


3.1.42. (!) An algorithm to greedily build a large independent set iteratively selects 
a vertex of minimum degree in the remaining graph and: deletes it and its neighbors. 
Prove that this algorithm produces an independent set of size at least )° yg) qari in 
a graph G. (Caro [1979], Wei [1981]) 


3.1.43. Let M be a maximal matching and L a minimal edge cover in a graph with no 

isolated vertices. Prove the statements below. (Norman—Rabin [1959], Gallai [1959]) 
a) M isa maximum matching if and only if M is contained in a minimum edge cover. 
b) L is a minimum edge cover if and only if L contains a maximum matching. 


3.1.44. (—) Let G be a simple graph in which the sum of the degrees of any k vertices 
is less than n — k. Prove that every maximal independent set in G has more than k 
vertices. (Meyer [1972]) 


3.1.45. An edge e of a graph G is a-critical if a(G — e) > a(G). Suppose that xy and 
xz are a-critical edges in G. Prove that G has an induced subgraph that is an odd cycle 
containing xy and xz. (Hint: Let Y, Z be maximum independent sets in G—x y and G—xz, 
respectively. Let H = G[YaZ]. Prove that every component of H has the same number of 
vertices from Y and from Z. Use this to prove that y and z belong to the same component 
of H.) (Berge [1970], with a difficult generalization in Markossian—Karapetian [1984]) 


3.1.46. (*—) Characterize the graphs with domination number 1. 


3.1.47. («—) Find the smallest tree where the domination number and the vextex cover 
number are not equal. 


3.1.48. (x—) Determine y(C,,) and y(P,). 
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3.1.49. (+) Let G be a graph without isolated vertices, and let S be a minimal dominating 
set in G. Prove that S is a dominating set. Conclude that y(G) < n(G)/2. (Ore [1962]) 


3.1.50. (*) Prove that y(G) < n — B’(G) < n/2 when G is an n-vertex graph without iso- 
lated vertices. For 1 < k < n/2, construct a connected n-vertex graph G with y(G) =k. 


3.1.51. (*) Let G be an n-vertex graph. 
a) Prove that [n/(1 + A(G)] = y(G) <n — A(G). 
b) Prove that (1 + diam G)/3 < y(G) <n — [diam G/3]. 


3.1.52. («) Prove that if the diameter of G is at least 3, then y(G) < 2. 


3.1.53. (*) For all k € N, construct a connected graph with 5k vertices and domination 
number 2k. Construct a single 3-regular graph G such that y(G) = 3n(G)/8. 


3.1.54. (*) Determine the domination number of the Petersen graph, and determine the 
minimum size of a total dominating set in the Petersen graph. 


3.1.55. (*) In the hypercube Q4, determine the minimum sizes of a dominating set, an 
independent dominating set, a connected dominating set, and a total dominating set. 


3.1.56. (*) Find a way to place five queens on an eight-by-eight chessboard that attack 
all other squares. Show that the five queens cannot be placed so that also they do not 
attack each other. (Comment: Thus the independent domination number of the “queen’s 
graph” exceeds its domination number; it is 7.) 
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3.1.57. (*) For all € N, construct an n-vertex tree with domination number 2 in which 
the minimum size of an independent dominating set is [n/2]. 


3.1.58. (*) Prove that a K,,-free graph G has an independent dominating set of size 
at most (r — 2)y(G) — (r — 3). (Hint: Generalize the argument of Theorem 3.1.34.) 
(Bollobaés—Cockayne [1979]) 


3.1.59. (*) In a graph G of order n, prove that the minimum size of a connected domi- 
nating set is nm minus the maximum number of leaves in a spanning tree. 


3.1.60. (*) For k < 5, every graph G with 5(G) < k has a connected dominating set of 
size at most 3n(G)/(k + 1) (Kleitman—West [1991], Griggs—-Wu [1992]). Prove that this 
is sharp using a graph formed from a cyclic arrangement of 3m pairwise-disjoint cliques 
by making each vertex adjacent to every vertex in the clique before it and the clique 
after it. Let the clique sizes be [k/2], |k/2] ,1, [k/2], |k/2],1,--> 


3.2. Algorithms and Applications 


MAXIMUM BIPARTITE MATCHING 


To find a maximum matching, we i vaieel seek augmenting paths to en- 
large the current matching. In a bipartite graph, if we don’t find an augmenting 
path, we will find a vertex cover with the same size as the current matching, 
thereby proving that the current matching has maximum size. This yields both 
an algorithm to solve the maximum matching problem and an algorithmic proof 
of the Kénig—Egervary Theorem. 

Given a matching M in an X, Y-bigraph G, we search for M-augmenting 
paths from each M-unsaturated vertex in X. We need only search from vertices 
in X, because every augmenting path has odd length and thus has ends in both 
X and Y. We will search from the unsaturated vertices in X simultaneously. 
Starting with a matching of size 0, a'(G) applications of the Augmenting Path 
Algorithm produce a maximum matching. 


3.2.1. Algorithm. (Augmenting Path Algorithm). 

Input: An X, Y-bigraph G, a matching M in G, and the set U of M-unsaturated 
vertices in X. 

Idea: Explore M-alternating paths from U, letting S C X and T C Y be the 
sets of vertices reached. Mark vertices of S that have been explored for path 
extensions. As a vertex is reached, record the vertex from which it is reached. 
Initialization: S = U and T = @. 

Iteration: If S has no unmarked vertex, stop and — T U(X — S) as a min- 
imum cover and M as a maximum matching. Otherwise, select an unmarked 
x € S. To explore x, consider each y € N(x) such that xy ¢ M. If y is unsatu- 
rated, terminate and report an M-augmenting path from U to y. Otherwise, y 
is matched to some w € X by M. In this case, include y in T (reached from x) 
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and include w in S (reached from y). After exploring all such edges incident to 
x, mark x and iterate. a 


U S 
x 


FF 


When exploring x in the iterative step, we may reach a vertex y € T that 
we have reached previously. Recording x as the previous vertex on the path 
may change which M-augmenting path we report, but it won’t change whether 
such a path exists. 


3.2.2. Theorem. Repeatedly applying the Augmenting Path Algorithm to a 
bipartite graph produces a matching and a vertex cover of equal size. 


Proof: We need only verify that the Augmenting Path Algorithm produces an 
M-augmenting path or a vertex cover of size |M|. If the algorithm produces an 
M-augmenting path, we are finished. Otherwise, it terminates by marking all 
vertices of S and claiming that R = T U(X — S) is a vertex cover of size |M|. We 
must prove that R is a vertex cover and has size |M|. 

To show that R is a vertex cover, it suffices to show that there is no edge 
joining S to Y — T. An M-alternating path from U enters X only on an edge of 
M. Hence every vertex x of S — U is matched via M to a vertex of 7, and there 
is no edge of M from S to Y — T. Also there is no such edge outside M. When 
the path reaches x € S, it can continue along any edge not in M, and exploring 
x puts all other neighbors of x into T. Since the algorithm marks all of S before 
terminating, all edges from S go to T. 

Now we study the size of R. The algorithm puts only saturated vertices in 
T; each y € T is matched via M to a vertex of S. Since U C S, also each vertex 
of X — S is saturated, and the edges of M incident to X — S cannot involve T. 
Hence they are different from the edges saturating 7, and we find that M has 
at least |7| + |X — S| edges. Since there is no matching larger than this vertex 
cover, we have |M| = |T| + |X — S| =|R|. a 


In addition to studying the correctness of algorithms, we are concerned 
about the time (number of computational steps) they use. We measure this as a 
function of the size of the input. For graph problems, we usually use the order 
n(G) and/or size e(G) to measure the input size. 


3.2.3. Definition. The running time of an algorithm is the maximum number 
of computational steps used, expressed as a function of the size of the input. 

A good algorithm is one that has polynomial running time. 
Running time is often expressed as “O( f)”, where f is a function of the 
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size of the input. Here O( f) denotes the set of functions g such that |(x)| 
is bounded by a constant multiple of | f(x)| when x is sufficiently large (that 
is, there exist c, a such that |g(x)| < c|f(x)| when |x| > a). 


Many problems we study in Chapters 1-4 have good algorithms; other no- 
tions of complexity (Appendix B) need not trouble us yet. Since we don’t know 
how long a particular operation may take on a particular computer, constant 
factors in running time have little meaning. Hence the “Big Oh” notation O(f) 
is convenient. When f is a quadratic polynomial, we typically abuse nota- 
tion by writing O(n”) instead of O(f) to describe functions that grow at most 
quadratically in terms of n. 


3.2.4. Remark. Let G be an X, Y-bigraph with n vertices and m edges. Since 
a’(G) < n/2, we find a maximum matching in G by applying Algorithm 3.2.1 
at most n/2 times. Each application explores a vertex of X at most once, just 
before marking it; thus it considers each edge at most once. If the time for 
one edge exploration is bounded by a constant, then this algorithm to find a 
maximum matching runs in time O(nm). Theorem 3.2.22 presents a faster 
algorithm, with running time O(./nm). Section 3.3 discusses a good algorithm 
for maximum matching in general graphs. | 


) 
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WEIGHTED BIPARTITE MATCHING 


um 


Our results on maximum matching generalize to weighted X, Y-bigraphs, 
where we seek a matching of maximum total weight. If our graph is not all of 
K,n, then we insert the missing edges and assign them weight 0. This does not 
affect the numbers we can obtain as the weight of a matching. Thus we assume 
that our graph is K,, n. 

Since we consider only nonnegative edge weights, some maximum weighted 
matching is a perfect matching; thus we seek a perfect matching. We solve both 
the maximum weighted matching problem and its dual. 


3.2.5. Example. Weighted bipartite matching and its dual. A farming company 
owns n farms and n processing plants. Each farm can produce corn to the 
capacity of one plant. The profit that results from sending the output of farm i 
to plant j is w;,;. Placing weight w;,; on edge x;y; gives us a weighted bipartite 
graph with partite sets X = {x;,;...,x,} and Y = {y,,...,y,}. The company 
wants to select edges forming a matching to maximize total profit. 

The government claims that too much corn is being produced, so it will pay 
the company not to process corn. The government will pay u; if the company 
agrees not to use farm i and vu; ifit agrees not to use plant j. Ifu;+v; < w;,;, then 
the company makes more by using the edge x;y; than by taking the government 
payments for those vertices. In order to stop all production, the government 
must offer amounts such that u; + v; > w;,; for alli, j. The government wants 
to find such values to minimize > u; + >> v;. i 
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3.2.6. Definition. A transversal of an n-by-n matrix consists of n positions, 
one in each row and each column. Finding a transversal with maximum 
sum is the Assignment Problem. This is the matrix formulation of the 
maximum weighted matching problem, where nonnegative weight w; ; 
is assigned to edge x;y; of K,,, and we seek a perfect matching M to maxi- 
mize the total weight w(M). 

With these weights, a (weighted) cover is a choice of labels u;, ..., uy 
and v;,..., uv, such that u; + v; > w;,; for alli, j. The cost c(u, v) of a cover 
(u,v) is }/u; + }.v;. The minimum weighted cover problem is that of 
finding a cover of rinimum cost. 


Note that the problem of minimum weight perfect matching can be solved 
using maximum weight matching; simply replace each weight w;,; with M — w;,; 
for some large number M. 

The next lemma shows that the weighted matching and weighted cover 
problems are dual problems. 


3.2.7. Lemma. For a perfect matching M and cover (u, v) in a weighted bipar- 
tite graph G, c(u, v) => w(M),. Also, c(u, v) = w(M) if and only if M consists 
of edges x;y; such that u; + v; = w;;. In this case, M and (u, v) are optimal. 


Proof: Since M saturates each vertex, summing the constraints u; + v; > wi,j 
that arise from its edges yields c(u, v) > w(M) for every cover (u,v). Further- 
more, if c(u, v) = w(M), then equality must hold in each of the n inequalities 
summed. Finally, since c(u,v) => w(M) for every matching and every cover, 
c(u, v) = w(M) implies that there is no matching with weight greater than 
c(u, v) and no cover with cost less than’w(M). a 


A matching and a cover have the same value only when the edges of the 
matching are covered with equality. This leads us to an algorithm. 


3.2.8. Definition. The equality subgraph G,, , for a cover (u, v) is the span- 
ning subgraph of K,,,, having the edges x;y; such that u; + vj = wj,;. 


If G,,, has a perfect matching, then its weight is }° u;+)_ v;, and by Lemma 
3.2.7 we have the optimal solution. Otherwise, we find a matching M and a ver- 
tex cover Q of the same size in G,,, (by using the Augmenting Path Algorithm, 
for example). Let R= QN X and T = QNY. Our matching of size |Q| consists 
of |R| edges from R to Y — T and |T| edges from T to X — R, as shown below. To 
seek a larger matching in the equality subgraph, we change (u, v) to introduce 
an edge from X — R to Y — T while maintaining equality on all edges of M. 

A cover requires u; + v; > w;,; for all i, j; the difference u; + v; — w;,; 1s 
the excess for i, j. Edges joining X — R and Y — T are not in G,,, and have 
positive excess. Let « be the minimum excess on the edges from X — R to Y —T. 
Reducing u; by ¢€ for all x; € X — R maintains the cover condition for these edges 
while bringing at least one into the equality subgraph. To maintain the cover 
condition for the edges from X — R to T, we also increase v, by ¢€ for y; € T. 
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We repeat the procedure with the new equality subgraph; eventually we 
obtain a cover whose equality subgraph has a perfect matching. The resulting 
algorithm was named the Hungarian Algorithm by Kuhn in honor of the 
work of Konig and Egervary on which it is based. 


—€ 
U S R 


T 
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3.2.9. Algorithm. (Hungarian Algorithm—Kuhn [1955], Munkres [1957]). 
Input: A matrix of weights on the edges of K,,,, with bipartition X,Y. 

Idea: Iteratively adjusting the cover (uv, v) until the equality subgraph G,,., has 
a perfect matching. 

Initialization: Let (u,v) be a cover, such as u; = max; w,_; and v; = 0. 
Iteration: Find a maximum matching M in G, ». If M is a perfect matching, 
stop and report M as a maximum weight matching. Otherwise, let Q be a 
vertex cover of size |M| in G,.,. Let R= XN QandT=YNQ. Let 


~ 


€ = min{u; + vj — wi joa; €X— RK, yy EY —T}. 


Decrease u; by € for x; € X — R, andi increase v; by € for y; € T. Form the new 
equality subgraph and repeat. a 


telegr 


We have presented the algorithm using bipartite graphs, but repeatedly 
drawing a changing equality subgraph is awkward. Therefore, we compute 
with matrices. The initial weights form a matrix A with w,_; in position i, 7. We 
associate the vertices and the labels (u,v) with the rows and columns, which 
serve as X and Y, respectively. We subtract w;_; from u; +; to obtain the excess 
matrix: c;,; = u; + v; — w;,;. The edges of the equality subgraph correspond to 
Os in the excess matrix. 


3.2.10. Example. Solving the Assignment Problem. The first matrix below is 
the matrix of weights. The others display a cover (u, v) and the corresponding 
excess matrix. We underscore entries in the excess matrix to mark a maximum 
matching M of G,.,, which appears as bold edges in the equality subgraphs 
drawn for the first two excess matrices. (Drawing the equality subgraphs is not 
necessary.) A matching in G,, corresponds to a set of Os in the excess matrix 
with no two in any row or column; call this a partial transversal. 

A set of rows and columns covering the Os in the excess matrix is a cover- 
ing set; this corresponds to a vertex cover in G,,. A covering set ot size less 
than n yields progress toward a solution, since the next weighted cover costs 
less. We study the Os in the excess matrix and find a partial transversal and a 
covering set of the same size. In a small matrix, we can do this by inspection. 


128 Chapter 3: Matchings and Factors 


We underscore the Os of a partial transversal, and we use Rs and Ts to 
label the rows and columns of the covering set. At each iteration, we compute 
the minimum excess on the positions not in a covered row or column (in rows 
X — Rand columns Y — 7). These uncovered positions have positive excess (the 
corresponding edges are not in the equality subgraph). The value « defined in 
Algorithm 3.2.9 is the minimum of these excesses. We reduce the label u; by « 
on rows not in R and increase the label v; by € on columns in T. 

In the example below, the covering set used in the first iteration reduces the 
cost of the cover but does not augment the maximum matching in the equal- 
ity subgraph. The second iteration produces a perfect matching. Using the 
last three columns as a covering set in the first iteration would augment the 
matching immediately. 

The transversal of 0s after the final iteration identifies a perfect matching 
whose total weight equals the cost of the final cover. The corresponding edges 
Ahave weights 5, 4, 6, 8,8 in the original data, which sum to 31. The labels 
4,5, 7, 4,6 and 0, 0, 2, 2, 1 in the final cover satisfy each edge exactly and also 
sum to 31. The value of the optimal solution is unique, but the solution itself is 
not; this example has many maximum weight matchings and many minimum 


cost covers, but all have total weight 31. | 
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3.2.11. Theorem. The Hungarian Algorithm finds a maximum weight match- 
ing and a minimum cost cover. 


Proof: The algorithm begins with a cover. It can terminate only when the 
equality subgraph has a perfect matching, which guarantees equal value for 
the current matching and cover. Suppose that (u, v) is the current cover and 
that the equality subgraph has no perfect matching. Let -(u’, v’) denote the 
new lists of numbers assigned to the vertices. Because € is the minimum of a 
nonempty finite set of positive numbers, € > 0. 
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We verify first that (u’, v’) is a cover. The change of labels on vertices of 
X — Rand T yields u; + v, = u; + v; for edges x;y; from X — R to T or from R to 
Y—T. Ifx; € Rand y; €T, then uv; +v; =u; + vj + €, and the weight remains 
covered. If x; « X — R and y; € Y.—T, then u; + v; equals u; + vj — €, which by 
the choice of ¢€ is at least wj,;. 

The algorithm terminates only when the equality subgraph has a perfect 
matching, so it suffices to show that it does terminate. Suppose that the weights 
w;,; are rational. Multiplying the weights by their least common denominator 
yields an equivalent problem with integer weights. We can now assume that 
the labels in the current cover also are integers. Thus each excess is also an 
integer, and at each iteration we reduce the cost of the cover by an integer 
amount. Since the cost starts at some value and is bounded below by the weight 
of'a perfect matching, after finitely many iterations we have equality. 

For real-valued weights in general, see Remark 3.2.12). | 


3.2.12.* Remark. When the weights are real numbers, the algorithm still 
works if we obtain vertex covers in the equality subgraph more carefully. We 
show that the algorithm terminates within n? iterations. Because the edges 
of M remain in the new equality subgraph, the size of the current matching 
never decreases. Since the size of the matching can increase at most n times, it 
suffices to show that it must increase within n iterations. 

If we find the maximum matching M by iterating the Augmenting Path 
Algorithm, then the last iteration presents us with a vertex cover. We find it by 
exploring M-alternating paths from the set U of M-unsaturated vertices in X. 
With S and T denoting the sets of vertices reachable in X and T, we obtain the 
vertex cover RUT, where R = X —S. © 

Applying a step of the Hungarian Algorithm using the vertex cover R UT 
maintains equality on M and all the edges in M-alternating paths from U. 
Edges from T to R disappear from the equality subgraph, but we don’t care 
because they don’t appear in M-alternating paths from U. Introducing an edge 
from S to Y — T either creates an M-augmenting path or increases T while 
leaving U unchanged. Since we can increase T at most n times, we obtain a 
larger matching in the equality subgraph within n iterations. si 


3.2.13.* Remark. The maximum matching and vertex cover problems in bi- 
partite graphs are special cases of the weighted problems. Given a bipartite 
graph G, form a weighted graph with weight 1 on the edges of G and weight 0 
on the edges of K,,,. The maximum weight of a matching is a’(G). 

Given integer weights, the Hungarian algorithm always maintains integer 
labels in the weighted cover. Hence in this weighted cover problem we may 
restrict the values (labels) used to be integers. Further thought shows that 
these integers will always be 0 or 1. 

The vertices receiving label 1 must cover the weight on the edges of G, so 
they form a vertex cover for G. Minimizing the sum of labels under the integer 
restriction is equivalent to finding the minimum number of vertices in a vertex 
cover for G. Hence the answer to the weighted cover problem is B(G). | 
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3.2.14.* Application. Street Sweeping and the Transportation Problem. A 
cleaning machine sv eeping a curb must move in the same direction as traffic. 
This yields a digraph; a two-way street generates two oppositely directed edges, 
while a one-way street generates two edges in the same direction. We consider 
a simple version of the Street Sweeping Problem, discussed in more detail 
in Roberts [1978] as based on Tucker—Bodin [1976]. 

In New York City, parking is prohibited from sorne curbs each day to allow 
for street sweeping. For each day, this defines a sweep subgraph G of the full 
digraph H of curbs, consisting of those available for sweeping. Each e € E(H) 
has a deadheading time t¢(e) needed to travel it without sweeping. 

The question is how to sweep G while minimizing the total deadheading 
time spent without sweeping. This is 2 generalization of a directed version of 
the Chinese Postman Problem. If indegree equals outdegree at each vertex of 
G, then no deadheading is needed. Otherwise, we duplicate edges of G or add 
edges from H to obtain an Eulerian digraph G’ containing G. 

Let X be the set of vertices with excess indegree; let o(x) = d, (x) — ae (x) 
for x ¢ X. Let ¥ be the set with. excess outdegree; let 0(y) = dg(y) — dé (y) for 
y €Y. Note that >) ..y 9%) = Do) cy 9(y). To obtain G’ from G, we must add 
a(x) edges with tails at x € X and d(y) edges with heads at y € Y. Since G’ 
needs net outdegree 0 at each vertex, the additions form paths from X to Y. The 
cost c(xy) of an x, y-path is the distance from x to y in the weighted digraph H, 
which can be found by Dijkstra’s Algorithm. 

This yields the Transportation Problem. Given supply o(x) for x € X, 
demand 4@(y) for y € Y, cost c(xy) per unit sent from x to y, and )-o(x) = 
>_ d(y), we want to satisfy the demands at least total cost. A version of the 
problem was introduced by Kantorovich [1939]; the form above arose (with a 
constructive solution) in Hitchcock [1941] (see also Koopmans [1947]). The 
problem is discussed at length in Ford—Fulkerson [1962, p93—130]. 

When the supplies and demands are rationai, the Assignment Problem 
can be applied. First scale up to obtain integer supplies and demands. Next 
define a matrix with }>o(x) rows and columns. For each x € X, create a(x) 
rows. For each y € Y, create 5(y) columns. When row i and-column / represent 
x and y, let w;; = M —c(xy), where M = max,,c(xy). A maximum weight 
matching now yields a minimum cost solution to the Transportation Problem. 
A generalization of the Transportation Problem appears in Section 4.3. | 


STABLE MATCHINGS (optional) 


Instead of optimizing total weight for a matching, we may try to optimize 
using preferences. Given n men and n women; we want to establish n “stable” 
marriages. If man x and woman a are paired with other partners, but x prefers 
a to his current partner and a prefers x to her current partner, then they might 
leave their current partners and switch to each other. In this situation we say 
that the unmatched pair (x, a) is an unstable pair. 
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3.2.15. Definition. A perfect matching is a stable matching if it yields no 
unstable unmatched pair. 


3.2.16. Example. Given men x, y, z, w, women a, b,c, d, and preferences listed 
below, the matching {xa, yb, zd, wc} is a stable matching. a 


Men {x, y,z,w} Women {a, b,c, d} 
x:a>b>c>d a:z>x>y>w 
y:a>c>b>d b:y>wr>x>z 
zic>d>a>b c:w>x>y>z 
wic>b>a>d d:x>y>z>w 


In their paper “College ad:nissions and the stability of marriage”, Gale and 
Shapley proved that a stable matching always exists and can be found using 
a relatively simple algorithm. In the algorithm, men and women do not play 
symmetric roles; we will discuss this importance of this difference later. The 
algorithm below generates the matching of Example 3.2.16. 


3.2.17. Algorithm. (Gale—Shapley Proposal Algorithm) 

Input: Preference rankings by each of n men and n women. 

Idea: Produce a stable matching using proposals by maintaining information 
about who has proposed to whom and who has rejected whom. 

Iteration: Each man proposes to the highest woman on his preference list who 
has not previously rejected him. If each woman receives exactly one proposal, 
stop and use the resulting matching. Otherwise, every woman receiving more 
than one proposal rejects all of them. except the one that is highest on her 
preference list. Every woman receiving a proposal says “maybe” to the most 
attractive proposal received. a 


3.2.18. Theorem. (Gale—Shapley [1962]) The Proposal Algorithm produces a 
stable matching. 


Proof: The algorithm terminates (with some matching), because on each non- 
terminal iteration, the total length of the lists of potential mates for the men 
decreases. This can happen only n? times. 

Key Observation: the sequence of proposals made by each man is nonin- 
creasing in his preference list, and the sequence of men to whom a woman says 
“maybe” is nondecreasing in her preference list, culminating in the man as- 
signed. This holds because men propose repeatedly to the same woman until 
rejected, and women say “maybe” to the same man until a better offer arrives. 

If the result is not stable, then there is an unstable unmatched pair (x, a), 
with x matched to b and y matched to a. By the key observation, x never 
proposed to a during the algorithm, since a received a mate less desirable than 
x. The key observation also implies that x would not have proposed to b without 
earlier proposing to a. This contradiction confirms the stability of the result. & 


The asymmetry of the proposal algorithm suggests asking which sex is 
happier. When the first choices of the men are distinct, they all get their first 
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choice, and the women are stuck with whomever proposed. When the algorithm 
runs with women proposing, every woman is at least as happy as when men 
do the proposing, and every man is at least as unhappy. In Example 3.2.16, 
running the algorithm with women proposing immediately yields the match- 
ing {xd, yb, ca, wc}, in which all women are matched to their first choices. In 
fact, among all stable matchings, every man is happiest in the one produced by 
the male-proposal algorithm, and every woman is happiest under the female- 
proposal algorithm (Exercise 11). Societal conventions thus favor men. 

The algorithm is used in another setting. Each year, the graduates of med- 
ical schools submit preference lists of hospitals where they wish to be residents. 
The hospitals have their own preferences; we model a hospital with multiple 
openings as several hospitals with the same preference list. Chaos in the mar- 
ket for residents (then called interns) forced hospitals to devise and implement 
the algorithm ten years before the Gale-Shapley paper defined and solved the 
problem! The result was the National Resident Matching Program, a non- 
profit corporation established in 1952 to provide a uniform appointment date 
and matching procedure. 

Who is happier with the outcome? Since the medical organizations ran the 
algorithm, it is not surprising that initially they did the proposing and were 
happier with the outcome. The distinetion is even clearer in another setting; 
students applying for jobs have preferences, but the employers make the pro- 
posals, called “job offers”. Unhappiness with the NRMP caused the system 
to be changed in 1998 to a student-proposing algorithm. In 1998 the system 
processed 35,823 applicants for 22,451 positions. Additional details about the 
system can be found at nrmp.aamc.org/nrmp/mainguid/ on the World Wide Web. 

There may be stable matchings other than those found by the two versions 
of the proposal algorithm. To seek a “fair” stable matching, we could give each 
person a number of points with which to rate preferences. The weight for the 
pair xa is then the sum of the points that x gives to a and a gives to x. The 
Hungarian Algorithm would yield a matching of maximum total weight, but 
this might not be a stable matching (Exercise 10). Other approaches appear 
in the books Knuth [1976] and Gusfield—Irving [1989], which discuss stable 
marriages and related topics. 


FASTER BIPARTITE MATCHING (optional) 


We began this section with an algorithm for finding maximum matchings 
in bipartite graphs. The running time can be improved by seeking augmenting 
paths in a clever order; when short augmenting paths are available, we needn’t 
explore many edges to find one. Using a Breadth-First Search simultaneously 
from all the unsaturated vertices of X, we can find many paths of the same 
length with one examination of the edge set. Hopcroft and Karp [1973] proved 
that subsequent augmentations must use longer paths, so the searches can be 
grouped in phases finding paths of the same lengths. They combined these 
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ideas to show that few phases are needed, enabling maximum matchings in 
n-vertex bipartite graphs to be found in O(n”) time. 


3.2.19. Remark. If M isa matching of size r and M* is a matching of size s > r, 
then there exist at least s —r vertex-disjoint M-augmenting paths. At least this 
many such paths can be found in MaM*. | 


The next lemma implies that the sequence of path lengths in successive 
shortest augmentations is nondecreasing. Here we treat paths as sets of edges, 
and cardinality indicates number of edges. 


3.2.20.Lemma. If P is a shortest M-augmenting path and P’ is MaP- 
augmenting, then |P’| > |P|+2|PN P’| (treating P as an edge set). 


Proof: Note that MaP is the matching obtained by using P to augment M. Let 
N be the matching (MaP)aP’ obtained by using P’ to augment MaP. Since 
|N| = |M| +2, Remark 3.2.19 guarantees that MaN contains two disjoint M- 
augmenting paths P; and P:. Each of these is at least as long as P, since P is 
a shortest M-augmentating path. 

Since N is obtained from M by switching the edges in P and then switch- 
ing the edges in P’, an édge belongs to exactly one of M and N if and only if 
it belongs to exactly one of P and P’.” Therefore, MaN = PaP’. This yields 
|PaP’|>|Pil+|P2|>2|/P|. Thus — 


2|P| <|PaP’| =|P|+|P’|-2|PP'. 
We conclude that |P’| > |P|+2/PN P|. a 
3.2.21. Lemma. If P), Po,--- is a list of successive shortest augmentations, 


then the augmentations of the same length are vertex-disjoint paths. 


Proof: We use the method of contradiction. Let P,, P; with 1 > k be a closest 
pair in the list that have the same size but are not vertex-disjoint. By Lemma 
3.2.20, the lengths of successive shortest augmenting paths are nondecreasing, 
so P,,..., P; all have the same length. Since P,, P; is a closest intersecting pair 
with the same length, the paths P,.1,..., P; are pairwise disjoint. 

Let M’ be the matching given by the augmentations P,..., P,. Since 
Py41,..., Py are pairwise disjoint, P; is an M’-augmenting path. By Lemma 
3.2.20, |P;| > |Py| + |P; O P|. Since |P;| = | P|, there is no common edge. 

On the other hand, there must be a common edge. Each vertex of P, is 
saturated in M’ using an edge of P,, and every vertex of.an M’-augmenting 
path P; that is saturated in M’ (such as a vertex common to P; and P,) must 
contribute its saturated edge to P;. 

The contradiction implies that there is no such pair P. P,. | 


3.2.22. Theorem. (Hopcroft—Karp [1973]) The breadth-first phased maximum 
matching algorithm. runs in O(,/nm) time on bipartite graphs with n ver- 
tices and m edges. 


134 Chapter 3: Matchings and Factors 


Proof: By Lemmas 3.2.20-3.2.21, searching simultaneously from all unsatu- 
rated vertices of X for shortest augmentations yields vertex-disjoint paths, after 
which all other augmenting paths are longer. Hence the augmentations of each 
length are found in one examination of the edge set, running in time O(m). It 


suffices to prove that there are at most 2 lV n/ 2| + 2 phases. 


List the augmenting paths as P;,..., P, in order by length, with s = a/(G) < 
n/2. Since paths of the same length are vertex-disjoint, each P;,1 is an augment- 
ing path for the matching M; formed by using P;,..., P;. It suffices to prove the 
more general statement that whenever P;,..., P, are successive shortest aug- 
menting paths that build a maximum matching, the number of distinct lengths 
among these paths is at most 2| /s| + 2. 

Let r = [s — al SI Because |M,| = r and the maximum matching has size 
s, Remark 3.2.19 yields at least s — r vertex-disjoint M,-augmenting paths. 
The shortest of these paths uses at most |r/(s —r)| edges from M,. Hence 
|Prail < 2 Lr/(s —r)} +1. Since [r/(s —r)} < |s/[/s]]| < | /s], the paths up to 
P, provide all but the last | /s ] augmentations using length at most 2 | /s|+1. 
There are at most | Vs | +1 distinct odd integers up to this value, and even if the 
last | /s] paths have distinct lengths, they provide at most | /s| +1 additional 
lengths, so altogether we use at most 2|/s| + 2 distinct lengths. a 


Even and Tarjan [1975] extended this to solve in time O(./nm) a more 
general problem that includes maximum bipartite matching. 


I 


telegram 


EXERCISES 


3.2.1. (—) Using nonnegative edge weights, construct a 4-vertex weighted graph in 
which the matching of maximum weight is not a matching of maximum size. 


3.2.2. (—) Show how to use the Hungarian Algorithm to test for the existence of a perfect 
matching in a bipartite graph. 


3.2.3. (*—) Give an example of the stable matching problem with two men and two 
women in which there is more than one stable matching. 


3.2.4. (*—) Determine the stable matchings resulting from the Proposal Algorithm: run 
with men proposing and with women proposing, given the preference lists below. 


Men {u, v, w, x, y, z} Women {a, b,c, d,e, f} 
:a>b>d>c>f>re :Z>x>yru>v>w 
:a>b>c>f>e>d :y>z>w>x>u>u 
:c>b>d>a>f>e civ>x>w>yru>z 
:c>a>d>b>e>f >:w>y>u>x>z>0v 
:c>d>a>b>f>e :u>u>x>w>y>z 
:d>e>f>c>b>a :u>w>x>v>z>y 


Ne &® &e & 
Se Woes 
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3.2.5. Find a transversal of maximum total sum (weight) in each matrix below. Prove 
that there is no larger weight transversal by exhibiting a solution to the dual problem. 
Explain why this proves that there is no larger transversal. 


(a) (b) (c) 


44436 78987 12345 
11434 87676 67872 
14535 96546 13445 
56479 85764 36287 
53683 76555 41354 


3.2.6. Find a minimum-weight transversal in the matrix below, and use duality to prove 
that the solution is optimal. (Hint: Use a transformation of the problem.) 


4 5 8 10 11 
76 5 7 4 
8 5 12 9 6 
6 6 13 10 7 
45 7 9 8 


3.2.7. The Bus Driver Problem. Let there be n bus drivers, n morning routes with 
durations x),...,X,, and n afternoon routes with durations y;,..., y,. A driver is paid 
overtime when the morning route and afternoon route exceed total time t. The objective 
is to assign one morning run and one afternoon run to each driver to minimize the total 
amount of overtime. Express this as a weighted matching problem. Prove that giving 
the ith longest morning route and ith shortest afternoon route to the same driver, for 
each i, yields an optimal solution. (Hint: Do not use the Hungarian Algorithm; consider 
the special structure of the matrix.) (R.B. Potts) 

3.2.8. Let the entries in matrix A have the form w; ; = a;b;, where aj, ... , a, are numbers 
associated with the rows and },,...,b, are numbers associated with the columns. De- 
termine the maximum weight of a transversal of A. What happens when w,,; = a; + b;? 
(Hint: In each case, guess the general pattern by examining the solution when n = 2.) 


3.2.9. («) A mathematics department offers k seminars in different topics to its n stu- 
dents. Each student will take one seminar; the ith seminar will have k; students, where 
>> k; =n. Each student submits a preference list ranking the k seminars. An assign- 
ment of the students to seminars is stable if no two students can both obtain more 
preferable seminars by switching their assignments. Show how to find a stable assign- 
ment using weighted bipartite matching. (Isaak) 


3.2.10. («) Consider n men and n women, each assigning n — i points to the ith person 
in his or her preference list. Let the weight of a pair be the sum of the points assigned 
by those two people. Construct an example where no maximum weight matching is a 
stable matching. 


3.2.11. («!) Prove that if man x is paired with woman a in some stable matching, then a 
does not reject x in the Gale-Shapley Proposal Algorithm with men proposing. Conclude 
that among all stable matchings, every man is happiest in the matching produced by 
this algorithm. (Hint: Consider the first occurrence of such a rejection.) 


3.2.12. («) In the Stable Roommates Problem, each of 2n people has a preference order- 
ing on the other 2n — 1 people. A stable matching is a perfect matching such that no 
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unmatched pair prefers each other to their current roommates. Prove that there is no 
stable matching when the preferences are those below. (Gale—Shapley [1962]) 


a:b>c>d 
b:c>a>d 
c:a>b>d 
d:a>b>c 


3.2.13. (*) In the stable roommates problem, suppose that each individual declares a 
top portion of the preference list as “acceptable”. Define the acceptability graph to be 
the gruph whose vertices are the people and whose edges are the pairs of people who 
rank each other as acceptable. Prove that all sets of rankings with acceptability graph 
G lead to a stable matching if and only if G is bipartite. (Abeledo—Isaak [1991]). 


3.3. Matchings in General Graphs 


When discussing perfect matchings in graphs, it is natural to consider more 
general spanning subgraphs. 


3.3.1. Definition. A factor of a graph G is a spanning subgraph of G. A k- 
factor is a spanning k-regular subgraph. An odd component of a graph 
is a component of odd order; the number of odd components of H is o(H). 


3.3.2. Remark. A 1-factor and a perféct matching are almost the same thing. 
The precise distinction is that “1-factor” is a spanning 1-regular subgraph of G, 
while “perfect matching” is the set of edges in such a subgraph. 

A 3-regular graph that has a perfect matching decomposes into a 1-factor 
and a 2-factor. a 


TUTTE’S 1-FACTOR THEOREM 


Tutte found a necessary and sufficient condition for which graphs have 1- 
factors. If G has a 1-factor and we consider a set § C V(G), then every odd 
component of G — S has a vertex matched to something outside it, which can 
only belong to S. Since these vertices of § must be distinct, o(G — S) < |S]. 


OC O 


even even 


The condition “For all § C V(G), 0(G — S) < |S|” is Tutte’s Condition. 
Tutte proved that this obvious necessary condition is also sufficient (TONCAS). 
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Many proofs are known, such as Exercise 13 and Exercise 27. We present the 
proof by Lovasz using the ideas of symmetric difference and extremality. 


3.3.3. Theorem. (Tutte [1947]) A graph G has a 1-factor if and only if o(G — 
S) < |S| for every S C V(G). 


Proof: (Lovasz [1975]). Necessity. The odd components of G — S must have 
vertices matched to distinct vertices of S. 

Sufficiency. When we add an edge joining two components of G — S, the 
number of odd components does not increase (odd and even together become 
one odd component, two components of the same parity become one even compo- 
nent). Hence Tutte’s Condition is preserved by addition of edges: if G’ = G+e 
and S C V(G), then o0(G’ — S) < o(G — S) < |S|. Also, if G’ = G +e has no 
1-factor, then G has no 1-factor. 

Therefore, the theorem holds unless there exists a simple graph G such 
that G satisfies Tutte’s Condition, G has no 1-factor, and adding any missing 
edge to G yields a graph with a 1-factor. Let G be such a graph. We obtain a 
contradiction by showing that G actually does contain a 1-factor. 

Let U be the set of vertices in G that have degree n(G) — 1. 

Case 1: G—U consists of disjoint complete graphs. In this case, the vertices 
in each component of G — U can be paired i in any way, with one extra in the 
odd components. Since o(G — U) < |UJ and each vertex of U is adjacent to all 
of G — U, we can match the leftover vertices to vertices of U. 

The remaining vertices are in U , which i is a clique. To complete the 1- 
factor, we need only show that an even number of vertices remain in U. We 
have matched an even number, so it suffices to show that n(G) is even. This 
follows by invoking Tutte’s Condition for S = @, since a graph of odd order 
would have a component of odd order. 


Siolaie 


U 


Case 2: G — U is not a disjoint union of cliques. In this case, G — U has 
two vertices at distance 2; these are nonadjacent vertices x, z with a common 
neighbor y ¢ U (Exercise 1.2.23a). Furthermore, G — U has another vertex w 
not adjacent to y, since y ¢ U. By the choice of G, adding an edge to G creates 
a 1-factor; let M, and Mp2 be 1-factors in G + xz and G + yw, respectively. It 
suffices to show that M,aMp contains a 1-factor avoiding xz and yw, because 
this will be a 1-factor in G. 
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Let F = M,iaMg. Since xz € M; — Mz and yw € Mz — Mj, both xz and yw 
are in F. Since every vertex of G has degree 1 in each of M; and Mg, every 
vertex of G has degree 0 or 2 in F. Hence the components of F are even cycles 
and isolated vertices (see Lemma 3.1.9). Let C be the cycle of F containing xz. 

If C does not also contain yw, then the desired 1-factor consists of the edges 
of M, from C and all of M, not in C. 

If C contains both yw and xz, as shown below, then to avoid them we use 
yx or yz. In the portion of C starting from y along yw, we use edges of M, to 
avoid using yw. When we reach {x, z}, we use zy if we arrive at z (as shown); 
otherwise, we use xy. In the remainder of C we use the edges of Mz. We have 
produced a 1-factor of C that does not use xz or yw. Combined with M, or Mo 
outside C, we have a 1-factor of G. a 


3.3.4. Remark. Like other characterization theorems (such as Theorem 1.2.18 
and Theorem 3.1.11)), Theorem 3.3.3 yields short verifications both when the 
property holds and when it doesn’t. We prove that G has a 1-factor exists by 
exhibiting one. When it doesn’t exist, Theorem 3.3.3 guarantees that we can 
exhibit a set whose deletion leaves too many odd components. a 


3.3.5. Remark. For a graph G and any S € V(G), counting th 2 vertices modulo 
2 shows that |5S|+0(G—S) has the same parity as n(G). Thus also the difference 
o(G — S) — |S| has the same parity as n(G). We conclude that if n(G) is even 
and G has no 1-factor, then o(G — S) exceeds |S| by at least 2 for some S. g 


For non-bipartite graphs (such as odd cycles), there may be a gap between 
a'(G) and B(G) (see also Exercise 10). Nevertheless, another minimization 
problem yields a min-max relation for a'(T) in general graphs. This min-max 
relation generalizes Remark 3.3.5. The proof uses a graph transformation that 
involves a general graph operation. 


3.3.6. Definition. The join of simple graphs G and H, written G v H, is the 


graph obtained from the disjoint union G + H by adding the edges {xy : x € 
V(G),y € V(t)}. 


Py V Kg 
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3.3.7. Corollary. (Berge—-Tutte Formula—Berge [1958]) The largest number 
of vertices saturated by a matching in G is minscyg){n(G) — d(S)}, where 
d(S) = 0(G — S) —|S|. 

Proof: Given S C V(G), at most |S] edges can match vertices of S to vertices 

in odd components of G — S, so every matching has at least o(G — S) — |S| 

unsaturated vertices. We want to achieve this bound. 
Let d = max{o(G — S) — |S|: S C V(G)}. The case S = @ yields d > 0. Let 

G’ =GV Kg. Since d(S) has the same parity as n(G) for each S, we know that 

n(G’) is even. If G’ satisfies Tutte’s Condition, then we obtain a matching of the 

desired size in G from a perfect matching in G’, because deleting the d added 

vertices eliminates edges that saturate at most d vertices of G. 

The condition 0(G’ — S’) < |S’| holds for S’ = @ because n(G’) is even. 

If S’ is nonempty but does not contain all of K,, then G’ — S’ has only one 

component, and 1 < |s’|. Finally, when Kz C S’, we let S = S’ — V(K,). We 

have G’ — S’ = G—S, so 0(G' — S') = o(G — S) < |S| +d = |S’|. We have verified 

that G’ satisfies Tutte’s Condition. a 


= 
a= 
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Corollary 3.3.7 guarantees that there is a short PROOF that a maximum 
matching indeed has maximum size by exhibiting a vertex set S whose deletion 
leaves the appropriate number of odd components. 

Most applications of Tutte’s Theorem involve showing that some other con- 
dition implies Tutte’s Condition and hence guarantees a 1-factor. Some were 
proved by other means long before Tutte’s Theorem was available. 


3.3.8. Corollary. (Petersen [1891]) Every 3-regular graph with no cut-edge has 
a 1-factor. 


Proof: Let G be a 3-regular graph with no cut-edge. We prove that G satisfies 
Tutte’s Condition. Given S C V(G), we count the edges between S and the odd 
components of G — S. Since G is 3-regular, each vertex of S is incident to at 
most three such edges. If each odd component H of G — S is incident to at least 
three such edges, then 30(G — S) < 3[{S| and hence o(G — S) < |S|, as desired. 
Let m be the number of edges from S to H. The sum of the vertex degrees 
in H is 8n(H) — m. Since H is a graph, the sum of its vertex degrees must be 
even. Since n(/) is odd, we conclude that m must also be odd. Since G has 
no cut-edge, m cannot equal 1. We concludethat there are at least three edges 
from S to H, as desired. | 


Proof by contradiction would also be natural here. Assuming 0(G — S) > |S| 
also leads to 0(G — S) < |S|, so we rewrite the proof directly. Corollary 3.3.8 
is best possible; the Petersen graph satisfies the hypothesis but does not have 
two edge-disjoint 1-factors (Petersen [1898]). 
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Petersen also proved a sufficient condition for 2-factors. A connected graph 
with even vertex degrees is Eulerian (Theorem 1.2.26) and decomposes into 
edge-disjoint cycles (Proposition 1.2.27). For regular graphs of even degree, the 
cycles in some decomposition can be grouped to form 2-factors. 


3.3.9. Theorem. (Petersen [1891]) Every regular graph of even degree has a 
2-factor. 


Proof: Let G be a 2k-regular graph with vertices v;,..., v,. Every component 
of G is Eulerian, with some Eulerian circuit C. For each component, define a 
bipartite graph H with vertices u..... u, and w1,...,w, by putting uj  w,; 
if v; immediately follows v; somewhere on C. Because C enters and exits each 
vertex k times, H is k-regular. (Actuaiiy, H is the split of the digraph obtained 
by orienting G in accordance with C—see Definition 1.4.20.) 

Being a regular bipartite graph, H has a 1-factor M (Corollary 3.1.13). The 
edge incident to w; in H corresponds to an edge entering v; in C. The edge 
incident to u; in H corresponds to an edge exiting v;. Thus the 1-factor in H 
transforms into a 2-regular spanning subgraph of this component of G. Doing 
this for each component of G yields a 2-factor of G. a 


3.3.10. Example. Construction of a 2-factor. Consider the Eulerian circuit in 
G = K; that successively visits 1231425435. The corresponding bipartite graph 
H is on the right. For the 1-factor whose u, w-pairs are 12, 43, 25, 31, 54, the 
resulting 2-factor is the cycle (1, 2, 5, 4, 3). The remaining edges form another 
1-factor, which corresponds to the 2-factor (1, 4,2,3,5) that remainsinG. & 
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f-FACTORS OF GRAPHS (optional) 


A factor is a spanning subgraph of G; we ask about existence of factors of 
special types. A k-factor is a k-regular factor; we have studied 1-factors and 
2-factors. We can try to specify the degree at each vertex. 


3.3.11. Definition. Given a function f: V(G) — NU{0}, an f-factor ofa graph 
G is a subgraph A such that dy(v) = f(v) for all v € V(G). 


Tutte [1952] proved a necessary and sufficient condition for a graph G to 
have an f-factor (see Exercise 29). He later reduced the problem to checking for 
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a 1-factor in a related simple graph. We describe this reduction; it is a beautiful 
example of transforming a graph problem into a previously solved problem. 


3.3.12. Example. A graph transformation (Tutte [1954a]). We assume that 
f(w) < d(w) for all w; otherwise G has too few edges at w to have an f-factor. 
We then construct a graph H that has a 1-factor if and only if G has an f-factor. 
Let e(w) = d(w) — f(w); this is the excess degree at w and is nonnegative. 

To construct H, replace each vertex v with a biclique Kav) -(.) having partite 
sets A(v) of size d(v) and B(v) of size e(v). For each vw € E(G), add an edge 
joining one vertex of A(v) to one vertex of A(w). Each vertex of A(v) participates 
in one such edge. 

The figure below shows a graph G, vertex labels given by f, and the re- 
sulting simple graph H. The bold edges in H form a 1-factor that corresponds 


to an f-factor of G. In this example, the f-factor is not unique. | 
3 1 
G es ¢ 
2 1 ¢ 


3.3.13. Theorem. A graph G has an f-factor if and only if the graph H con- 
structed from G and f as in Example 3.3.12 has a 1-factor. 


Proof: Necessity. If G has an f-factor, then the corresponding edges in H leave 
e(v) vertices of A(v) unmatched; match them arbitrarily to the vertices of B(v) 
to obtain a 1-factor of H. 

Sufficiency. From a 1-factor of H, deleting B(v) and the vertices of A(v) 
matched into B(v) leaves f(v) edges at v. Doing this for each v and merging the 
remaining f(v) vertices of each A(v) yields a subgraph of G with degree f(v) 
at v. Itis an f-factor of G. | 


Tutte’s Condition for a 1-factor in the derived graph H of Example 3.3.12 
transforms into a necessary and sufficient condition for an f-factor in G. Among 
the applications is a proof of the Erdés—Gallai [1960] characterization of degree 
sequences of simple graphs (Exercise 29). 

Given an algorithm to find a 1-factor, the correspondence in Theorem 3.3.13 
provides an algorithmic test for an f-factor. Instead of just seeking a 1-factor 
(that is, a perfect matching), we next consider the more genera! problem of 
finding a maximum matching in a graph. 
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EDMONDS’ BLOSSOM ALGORITHM (optional) 


Berge’s Theorem (Theorem 3.1.10) states that a matching M in G has max- 
imum size if and only if G has no M-augmenting path. We can thus find a 
maximum matching using successive augmenting paths. Since we augment 
at most n/2 times, we obtain a good algorithm if the search for an augment- 
ing path does not take too long. Edmonds [1965a] presented the first such 
algorithm in his famous paper “Paths, trees, and flowers”. 

In bipartite graphs, we can search quickly for augmenting paths (Algorithm 
3.2.1) because we explore from each vertex at most once. An M-alternating path 
from u can reach a vertex x in the same partite set as u only along a saturated 
edge. Hence only once can we reach and explore x. This property fails in graphs 
with odd cycles, because M-alternating paths from an unsaturated vertex may 
reach x both along saturated and along unsaturated edges. 


3.3.14. Example. In the graph below, with M indicated in bold, a search for 
shortest M-augmenting paths from u reaches x via the unsaturated edge ax. If 
we do not also consider a longer path reaching x via a saturated edge, then we 
miss the augmenting path u, v, a,b,c, d,x, y. | 
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We describe Edmonds’ solution to this difficulty. If an exploration of M- 
alternating paths from u reaches a vertex x by an unsaturated edge in one 
path and by a saturated edge in another path, then x belongs to an odd cycle. 
Alternating paths from u can diverge only when the next edge is unsaturated 
(leaving vertex a in Example 3.3.14); when the next edge is saturated there 
is only one choice for it. From the vertex where the paths diverge, the path 
reaching x on an unsaturated edge has odd length, and the path reaching it on 
a saturated edge has even length. Together, they form an odd cycle. 


3.3.15. Definition. Let M be a matching in a graph G, and let u be an M- 
unsaturated vertex. A flower is the union of two M-alternating paths 
from u that reach a vertex x on steps of opposite parity (having not done 
so earlier). The stem of the flower is the maximal common initial path 
(of nonnegative even length). The blossom of the flower is the odd cycle 
obtained by deleting the stem. 


In Example 3.3.14, the flower is the full graph except y, the stem is the path 
u, v, a, and the blossom is the 5-cycle. The horticultural terminology echoes the 
use of tree for the structures given by most search procedures. 
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Blossoms do not impede our search. For each vertex z in a blossom, some 
M-alternating u, z-path reaches z on a saturated edge, found by traveling the 
proper direction around the blossom to reach z from the stem. We therefore can 
continue our search along any unsaturated edge from the blossom to a vertex 
not yet reached. Example 3.3.14 shows such an extension that immediately 
reaches an unsaturated vertex and completes an M-augmenting path. 

Since each vertex of a blossom is saturated by an edge on these paths, no 
saturated edge emerges from a blossom (except the stem). The effect of these 
two observations is that we can view the entire blossom as a single “superver- 
tex” reached along the saturated edge at the end of the stem. We search from all 
vertices of the supervertex blossom simultaneously along unsaturated edges. 

We implement this consolidation by contracting the edges of a blossom B 
when we find it. The result is a new saturated vertex b incident to the last 
(saturated) edge of the stem. Its other incident edges are the unsaturated edges 
joining vertices of B to vertices outside B. We explore from b in the usual way 
to extend our search. We may later find another blossom containing b; we 
then contract again. If we find an M-alternating path in the final graph from 
u to an unsaturated vertex x, then we can undo the contractions to obtain an 
M-augmenting path to x in the original graph. 

Except for the treatment of blossoms, the approach is that of Algorithm 
3.2.1 for exploring M-alternating paths. In the corresponding phrasing, T is 
the set of vertices of the current graph-reached along unsaturated edges, and S$ 
is the set of vertices reached along saturated edges. The vertices that arise by 
contracting blossoms belong to S. 


1m.m 
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3.3.16. Example. Let M be the bold matching in the graph on the left below. 
We search from the unsaturated vertex u for an M-augmenting path. We first 
explore the unsaturated edges incident to u, reaching a and b. Since a and 
b are saturated, we immediately extend the paths along the edges ac and bd. 
Now S = {u, c,d}. If we next explore from c, then we find its neighbors e and 
f along unsaturated edges. Since ef € M, we discover the blossom with vertex 
set {c, e, f}. We contract the blossom to obtain the new vertex C, changing S to 
{u, C,d}. This yields the graph on the right. 
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Suppose we now explore from the vertex C € S. Unsaturated edges take 
us to g and to d. Since g is saturated by the edge gh, we place A in S. Since 
d is already in S, we have found another blossom. The paths reaching d are 
u,b,d and u,a, C,d. We contract the blossom, obtaining the new vertex U and 
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the graph on the right below, with S = {U,h}. We next explore from h, finding 
nothing new (if we exhaust S without reaching an unsaturated vertex, then 
there is no M-augmenting path from u). Finally, we explore from U, reaching 
the unsaturated vertex x. 
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Having recorded the edge on which we reached each vertex, we can extract 
an M-augmenting u, x-path. We reached x from U, so we expand the blossom 
back into {u, a, C, d, b} and find that x is reached from U along bx. The path in 
the blossom U that reaches b on a saturated edge ends with C, d, b. Since C is 
a blossom in the original graph, we expand C back into {c, f, e}. Note that d is 
reached from C by the unsaturated edge ed. The path from the “base” of C that 
reaches e along a saturated edge is c, f, e. Finally, c was reached from a and a 
from u, so we obtain the full 4 -augmenting path u,a,c, f,e,d, b,x. | 


We summarize the steps of the algorithm, glossing over the details of im- 
plementation, especially the treatment of contractions. 


3.3.17. Algorithm. (Edmonds’ Blossom Algorithm [1965a]—sketch). 

Input: A graph G, a matching M in G, an M-unsaturated vertex u. 

Idea: Explore M-alternating paths from u, recording for each vertex the vertex 
from which it was reached, and contracting blossoms when found. Maintain 
sets S and T analogous to those in Algorithm 3.2.1, with S consisting of u and 
the vertices reached along saturated edges. Reaching an unsaturated vertex 
yields an augmentation. 

Initialization: S = {u} and T = @. 

Iteration: If S has no unmarked vertex, stop; there is no M-augmenting path 
from u. Otherwise, select an unmarked v € S. To explore from v, successively 
consider each y € N(z) such that y ¢ T. 

If y is unsaturated by M, then trace back from y (expanding blossoms as 
needed) to report an M-augmenting u, y-path. 

If y € S, then a blossom has been found. Suspend the exploration of v and 
contract the blossom, replacing its vertices in S and T by a single new vertex 
in S. Continue the search from this vertex in the smaller graph. 

Otherwise, y is matched to some w by M. Include y in T (reached from v), 
and include w in S (reached from y). 

After exploring all such neighbors of v, mark v and iterate. | 


We cannot explore all unsaturated vertices simultaneously as in Algorithm 
3.2.1, because the membership of vertices in blossoms depends on the choice of 
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the initial unsaturated vertex. Nevertheless, if we find no M-augmenting path 
from u, then we can delete u from the graph and ignore it in the subsequent 
search for a maximum matching (Exercise 26). 


3.3.18. Remark. Edmonds’ original algorithm runs in time O(n*). The im- 
plementation in Ahuja—Magnanti—Orlin [1993, p483-—494] runs in time O(n?). 
This requires (1) appropriate data structures to represent the blossoms and 
to process contractions, and (2) careful analysis of the number of contractions 
that can be performed, the time spent exploring edges, and the time spent 
contracting and expanding blossoms. 

The first algorithm solving the maximum matching problem in less than 
cubic time was the O(n*/*) algorithm in Even—Kariv [1975]. The best algo- 
rithm now known runs in time O(n!/m) for a graph with n vertices and m 
edges (this is faster than O(n*/”) for sparse graphs). The algorithm is rather 
complicated and appears in Micali—Vazirani [1980], with a complete proof in 
Vazirani [1994]. 

We have not discussed the weighted matching problem for general graphs. 
Edmonds [1965d] found an algorithm for this, which was implemented in time 
O(n®) by Gabow [1975] and by Lawler [1976]. Faster algorithms appear in 
Gabow [1990] and in Gabow-Tarjan [1989]. | 
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EXERCISES 


3.3.1. (—) Determine whether the graph below has a 1-factor. 


a 
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3.3.2. (—) Exhibit a maximum matching in the graph below, and use a result in this 
section to give a short proof that it has no larger matching. 
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3.3.3. (—) In the graph drawn below, exhibit a k-facter for each k in {0, 1, 2, 3, 4}. 


Ir 
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3.3.4. (—) Let G be a k-regular bipartite graph. Prove that G can be decomposed into 
r-factors if and only if r divides k. 


3.3.5. (—) Given graphs G and H, determine the number of components and maximum 
degree of G Vv H in terms of parameters of G and H. 


3.3.6. (!) Prove that a tree T has a perfect matching if and only if o(T — v) = 1 for every 
v € V(T). (Chungphaisan) 


3.3.7. (!) For each k > 1, construct a k-regular simple graph having no 1-factor. 


3.3.8. Prove that if a graph G decomposes into 1-factors, then G has no cut-vertex. 
Draw a connected 3-regular simple graph that has a 1-factor and has a cut-vertex. 


3.3.9. Prove that every graph G has a matching of size at least n(G)/(1+ A(G)). (Hint: 
Apply induction on e(G).) (Weinstein [1963]) 


3.3.10. (!) For every graph G, prove that B(G) < 2a’(G). For each k € N, construct a 
simple graph G with a’(G) = k and B(G) = 2k. 


3.3.11. Let T be a set of vertices in a graph G. Prove that G has a matching that 
saturates T if and only if for all S C V(G), the number of odd components of G — § 
contained in G[T] is at most |S]. 


3.3.12. (!) Extension of Kénig-Egervdry Theorem to general graphs. Given a graph G, 
let S,,..., 5, and T be subsets of V(G) such that each S; has odd size. These sets form a 
generalized cover of G if every edge of G has one endpoint in T or both endpoints in 
some S;. The weight of a generalized cover is |T| + )_ ||S;| /2]. Let 8*(G) be the min- 
imum weight of a generalized cover. Prove that a’(G) = 6*(G). (Hint: Apply Corollary 
3.3.7. Comment: every vertex cover is a generalized cover, and thus f*(G) < A(G).) 


3.3.13. (+) Tutte’s Theorem from Hall’s Theorem. Let G be a graph such that o(G — S) < 
|S| for all S C V(G). Let T be a maximal vertex subset such that 0(G — T) = |T|. 

a) Prove that every component of G — T is odd, and conclude that T 4 @. 

b) Let C be a component of G — T. Prove that Tutte’s Condition holds for every 
subgraph of C obtained by deleting one vertex. (Hint: Since C — x has even order, a 
violation requires o(C — x — S$) > |S| + 2.) 

c) Let H be a bipartite graph with partite sets T and C,. where C is the set of 
components of G — T. Fort € T and C € C, put ?rC é€ E(A) if and only if Ng(t) contains 
a vertex of C. Prove that H satisfies Hall’s Condition for a matching that saturates C. 

d) Use parts (a), (b), (c), and Hall’s Theorem to prove Tutte’s 1-factor Theorem by 
induction on n(G). (Anderson [1971], Mader [1973]) 
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3.3.14. For k € N, let G be a simple graph such that 6(G) > k and n(G) > 2k. Prove that 
a'(G) > k. (Hint: Apply Corollary 3.3.7.) (Brandt [1994]) 


3.3.15. Let G be a 3-regular graph with at most two cut-edges. Prove that G has a 
1-factor. (Petersen [1891]) 


3.3.16. (!) Let G be a k-regular graph of even order that remains connected when any 
k — 2 edges are deleted. Prove that G has a 1-factor. 


3.3.17. With G as in Exercise 3.3.16, use Remark 3.3.5 to prove that every edge of G 
belongs to some 1-factor. (Comment: This strengthens Exercise 3.3.16.) (Schénberger 
[1934] for k = 3, Berge [1973, p162]) 


3.3.18. (+) For each odd k greater than 1, construct a graph G with no 1-factor that is 
k-regular and remains connected when any k — 3 edges are deleted. (Comment: Thus 
Exercise. 3.3.16 is best possible.) 


3.3.19. (!) Let G be a 3-regular simple graph with no cut-edge. Prove that G decomposes 
into copies of P,. (Hint: Use Theorem 3.3.9.) 


3.3.20. (!) Prove that a 3-regular simple graph has a 1-factor if and only if it decomposes 
into copies of P,. 


3.3.21. (+) Let G be a 2m-regular graph, and let T be a tree with m edges Prove that 
if the diameter of T is less than the girth of G, then G decomposes into copies of T. 
(Hint: Use Theorem 3.3.9 to give an inductive proof of the stronger result that G has 
such a decomposition in which each vertex-is used once as an image of each vertex of T.) 
(Haggkvist) 5 


3.3.22. (!) Let G be an X, Y-bigraph. Let H-be the graph obtained from G by adding one 
vertex to Y ifn(G) is odd and then adding edges to make Y a clique. 

a) Prove that G has a matching of size |X| if and only if H has-a 1-factor. 

b) Prove that if G satisfies Hall’s Condition (|N(S)| > |S| for all S C X), then H 
satisfies Tutte’s Condition (0(H — T) < |T| for all T C V(A)). 

c) Use parts (a) and (b) to obtain Hall’s Theorem from Tutte’s Theorem. 
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3.3.23. Let G be a claw-free connected graph of even order. 

a) Let T be a spanning tree of G generated by Breadth-First Search (Algorithm 
2.3.8). Let x and y be vertices that have a common parent in T other than the root. 
Prove that x and y must be adjacent. 

b) Use part (a) to prove that G has a 1-factor. (Comment: Without part (a), one 
can simply prove the stronger result that the last edge in a longest path belongs to a 
1-factor.) (Sumner [1974a], Las Vergnas [1975]) 


3.3.24. (!) Let G be a simple graph‘of even order n having a set S of size k such that 
o(G — S) > k. Prove that G has at most (5) +k(n —k) + re) edges, and that this 
bound is best possible. Use this to determine the maximum size of a simple n-vertex 
graph with no 1-factor. (Erdés—Gallai [1961]) 


3.3.25. A graph G is factor-critical if each subgraph G — v obtained by deleting one 
vertex has a 1-factor. Prove that G is factor-critical if and only if n(G) is odd and o(G — 
S) < |S| for all nonempty S C V(G). (Gallai [1963a]) 


3.3.26. (!) Let M be a matching in a graph G, and let u be an M-unsaturated vertex. 
Prove that if G has no M-augmenting path that starts at u, then u is unsaturated in 
some maximum matching in G. 
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3.3.27. (*«) Assuming that Algonthm 3.3.17 is correct, we develop an algorithmic proof 
of Tutte’s Theorem (Theorem 3.3.3). 

a) Let G be a graph with no perfect matching, and let M be a maximum matching 
in G. Let S and T be the sets generated when running Algorithm 3.3.17 from u. Prove 
that |7| < |S| < o(G —T). 

b) Use part (a) to prove Theorem 3.3.3. 


3.3.28. («) Let N = NU {0}. Given f: V(G) > No, the graph G is f-soluble if there 
exists w: E(G) — No such that DwekG) w(uv) = f(v) for every v € V(G). 

a) Prove that G has an f-factor if and only if the graph H obtained from G by 
subdividing each edge twice and defining f to be 1 on the new vertices is f-soluble. 
(This reduces testing for an f-factor to testing f-solubility.) 

b) Given G and an f: V(G) — No, construct a graph H (with proof) such that G is 
f-soluble if and only if H has a 1-factor. (Tutte [1954al]) 


3.3.29. («+) Tutte’s f-factor condition and graphic sequences. Given f: V(G) > No, 
define f(S) = )°, 5 f(v) for S C V(G). For are disjoint subsets S$, T of V(G), let q(S, T) 
denote the number of components Q of G — § — T such that e(Q, T) + f(V(Q)) is odd, 
where e(Q, T) is the number of edges from Q to T. Tutte [1952, 1954a] proved that G 
has an f-factor if and only if 


q(S,T)+ f(T) - Lover dg-s(v) < f(S) 


for all choices of disjoint subsets S, T c V. 3 

a) The Parity Lemma. Let 5(S, T) =f (S)- F(T) + Dyer 4g-s(v) — HS, T). Prove 
that 5(S, 7) has the same parity as f(V) for disjoint sets $,T C V(G). (Hint: Use 
induction on |7|.) 

b) Suppose that G = K,, and f(v;) = di, where )/d; is even and d) > --- > dh. 
Use the f-factor condition and part (a) to prove that G has an f-factor if and only if 
pea idi < LF Lerania , 4; for allk,s withk +s <n. 

c) Conclude that d,,...,d, > 0 are the vertex degrees of a simple graph if and only 
if }° d; is even and yy - ee < k(k-—1)+ )0)_,,, min{k, d;} for 1 < k <n. (Erdés—Gallai 
[1960]) 
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Chapter 4 


Connectivity and Paths 


4.1. Cuts and Connectivity 


A good communication network is hard to disrupt. We want the graph 
(or digraph) of possible transmissions to remain connected even when some 
vertices or edges fail. When communication links are expensive, we want to 
achieve these goals with few edges. Loops are irrelevant for connection, so in 
this chapter we assume that our graphs and digraphs have no loops, espe- 
cially when considering degree conditions. 


CONNECTIVITY 
How many vertices must be deleted to disconnect a graph? 


4.1.1. Definition. A separating set or vertex cut of a graph G is a set S © 
V(G) such that G — S$ has more than one component. The connectivity 
of G, written «(G), is the minimum size of a vertex set S such that G — § 
is disconnected or has only one vertex. A graph G is k-connected if its 
connectivity is at least k. 7 


A graph other than a complete graph is k-connected if and only if every 
separating set has size at least k. We can view “k-connected” as a structural 
condition, while “connectivity k” is the solution of an optimization problem. 


4.1.2. Example. Connectivity of K, and Km n. Because a clique has no sepa- 
rating set, we need to adopt a convention for its connectivity. This explains the 
phrase “or has only one vertex” in Definition 4.1.1. We obtain «(K,) =n —1, 
while «(G) < n(G) — 2 when G is not a complete graph. With this convention, 
most general results about connectivity remain valid on complete graphs. 
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Consider a bipartition X,Y of K,,,.. Every induced subgraph that has at 
least one vertex from X and from Y is connected. Hence every separating set of 
Km contains X or Y. Since X and Y themselves are separating sets (or leave 
only one vertex), we have k(Km,) = min{m,n}. The connectivity of K33 is 3; 
the graph is 1-connected, 2-connected, and 3-connected, but not 4-connected. m 


A graph with more than two vertices has connectivity 1 if and only if it is 
connected and has a cutvertex. A graph with more than one vertex has connec- 
tivity 0 if and only if it is disconnected. The 1-vertex graph K, is annoyingly 
inconsistent; it is connected, but for consistency in discussing connectivity we 
set «(K,) = 0. 


4.1.3. Example. The hypercube Q,. For k > 2, the neighbors of one vertex in 
QO, form a separating set, so «(Q,) < k. To prove that «(Q,) = k, we show that 
every vertex cut has size at least k. We use induction on k. 

Basis step: k € {0,1}. For k < 1, Q; is a clique with k + 1 vertices and has 
connectivity k. 

Induction step: k > 2). By the induction hypothesis, «(Q,1) = k — 1. 
Consider the description of Q; as two copies Q and Q’ of Q,;_; plus a matching 
that joins corresponding vertices in Q and Q’ (Example 1.3.8). Let S be a vertex 
cut in Q,. If Q — S is connected and Q’ — § is connected, then Q; — S is also 
connected unless S contains at least one endpoint of every matched pair. This 
requires |S| > 2‘-!, but 2-1 > k fork = 2. 

Hence we may assume that Q — RY is disconnected, which means that S 
has at least k — 1 vertices in Q, by the induction hypothesis. If § contains no 
vertices of Q’, then Q’ — Sis connected-and all vertices of Q — S have neighbors 
in Q’ — S, so Q, — S is connected. Hence S$ must also contain a vertex of Q’. 
This yields |S| > k, as desired. | 


Deleting the neighbors of a vertex disconnects a graph (or leaves only one 
vertex), so «(G) < 6(G). Equality need not hold; 2K,, has minimum degree 
m — 1 but connectivity 0. 

Since connectivity k requires 5(G) > k, it also requires at least [kn/2] edges. 
The k-dimensional cube achieves this bound, but only for the case n = 2‘. The 
bound is best possible whenever k < n, as shown by the next example. 


4.1.4. Example. Harary graphs. Given k < n, place n vertices around a circle, 
equally spaced. If k is even, form H,,, by making each vertex adjacent to the 
nearest k/2 vertices in each direction around the circle. If k is odd and n is 
even, form H,, by making each vertex adjacent to the nearest (k — 1)/2 vertices 
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in each direction and to the diametrically opposite vertex. In each case, H;.,, is 
k-regular. 

When k and n are both odd, index the vertices by the integers modulo n. 
Construct H;,, from H;,_1,, by adding the edges i — i + (n — 1)/2 for 0 <i < 
(n —1)/2. The graphs H4, H5,3, and Hs 9 appear below. a 


4.1.5. Theorem. (Harary [1962a]) «(H;,,) = k, and hence the minimum num- 
ber of edges in a k-connected graph on n vertices is [kn/2]. 


Proof: We prove only the even case k = 2r, leaving the odd case as Exercise 
12. Let G = H,,,. Since 5(G) = k, it suffices to prove x(G) > k. For S C V(G) 
with |S| < k, we prove that G — S is connected. Consider u,v € V(G) — S. The 
original circular arrangement has a clockwise u, v-path and a counterclockwise 
u, v-path along the circle; let A and B be the sets of internal vertices on these 
two paths. E 

Since |S| < k, the pigeonhole principle implies that in one of {A, B}, S 
has fewer than k/2 vertices. Since in G each vertex has edges to the next k/2 
vertices in a particular direction, deleting fewer than k/2 consecutive vertices 
cannot block travel in that direction. Thus we can find a uv, v-path in G — S via 
the set A or B in which S has fewer than k/2 vertices. a 


me/ 


Harary’s construction determines the degree conditions that allow a graph 
to be k-connected. Exercise 22 determines the degree conditions that force a 
simple graph to be k-connected. Since it depends on vertex deletions, connec- 
tivity is not affected by deleting extra copies of multiple edges. Hence we state 
degree conditions for k-connectedness only in the context of simple graphs. 


4.1.6. Remark. A direct proof of «(G) > k considers a vertex cut S and proves 
that |5| > k, or it considers a set S with fewer than k vertices and proves that 
G — S is connected. The indirect approach assumes a cut of size less than k and 
obtains a contradiction. The indirect proof may be easier to find, but the direct 
proof may be clearer to state. 

Note also that if k < n(G) and G has a vertex cut of size less than k, then G 
has a vertex cut of size k — 1 (first delete the cut, then continue deleting vertices 
until k — 1 are gone, retaining a vertex in each of two components). 

Finally, proving x(G) = k also requires presenting a vertex cut of size k; 
this is usually the easy part. | 
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EDGE-CONNECTIVITY 


Perhaps our transmitters are secure and never fail, but our communication 
links are subject to noise or other disruptions. In this situation, we want to 
make it hard to disconnect our graph by deleting edges. 


4.1.7. Definition. A disconnecting set of edges is a set F C E(G) such that 
G — F has more than one component. A graph is k-edge-connected if ev- 
ery disconnecting set has at least k edges. The edge-connectivity of G, 
written «’(G), is the minimum size of a disconnecting set (equivalently, the 
maximum k such that G is k-edge-connected). 
Given S, T C V(G), we write [S, T] for the set of edges having one end- 
point in S and the other in 7. An edge cut is an edge set of the form [S, S], 
where S is a nonempty proper subset of V(G) and S denotes V(G) — S. 


cent | 
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disconnecting set edge cut 
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4.1.8. Remark. Disconnecting set vs. edge cut. Every edge cut is a disconnect- 
ing set, since G < [S, S] has no path from S to S. The converse is false, since 
a disconnecting set can have extra edges. Above we show a disconnecting set 
and an edge cut in bold; see also Exercise 13. 

Nevertheless, every minimal disconnecting set of edges is an edge cut (when 
n(G) > 1). If G — F has more than one component for some F C E(G), then 
for some component H of G — F we have deleted all edges with exactly one 
endpoint in H. Hence F contains the edge cut [V(H), V(H)], and F is nota 
minimal disconnecting set unless F = [V(H), V(H)]. a 


The notation for edge-connectivity continues our convention of using a 
“prime” for an edge parameter analogous to a vertex parameter. Using the 
same base letter emphasizes the analogy and avoids the confusion of using 
many different letters — and running out of them. 

Deleting one endpoint of each edge in an edge cut F deletes every edge of F. 
This suggests that «(G) < x'(G). However, we must be careful not to delete the 
only vertex of a component of G — F and thereby leave a connected subgraph. 


4.1.9. Theorem. (Whitney [1932a]) If G is a simple graph, then 
K(G) Sk'(G) < 6(G). 


Proof: The edges incident to a vertex v of minimum degree form an edge cut; 
hence «’(G) < 6(G). It remains to show that «(G) < x’(G). 
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We have observed that «(G) < n(G) — 1 (see Example 4.1.2). Consider a 
smallest edge cut [S, S]. If every vertex of S is adjacent to every vertex of S, 
then |[S, S]] = |S||S| > n(G) — 1 > «(G), and the desired inequality holds. 

Otherwise, we choose x € S and y € S with x # y. Let T consist of all 
neighbors of x in S and all vertices of § — {x} with neighbors in S. Every x, y- 
path passes through T, so T is a. separating set. Also, picking the edges from x 
to TMS and one edge from each vertex of TM S to S (shown bold below) yields 
|T| distinct edges of [S, S]. Thus «’(G) = |[S, S]| = |7| = «(G). r 
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We have seen that «(G) = 5(G) when G is a complete graph, a biclique, a 
hypercube, or a Harary graph. By Theorem 4.1.9, also «’(G) = 5(G) for these 
graphs. Nevertheless, in many graphs the set of edges incident to a vertex of 
minimum degree is not a minimum edge cut. The situation «’(G) < 5(G) is 
precisely the situation where no minimum edge cut isolates a vertex. 
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4.1.10. Example. Possibility of « < x’ < 5. For graph G below, «(G) = 1, 
x'(G) = 2, and 6(G) = 8. Note that no minimum edge cut isolates a vertex. 
Each inequality can be arbitarily ‘weak. When G = Km + Km, we have 
K(G) = k'(G) = 0 but 5(G) = m — 1. When G consists of two m-cliques sharing 
a single vertex, we have «’/(G) = 6(G) =m — 1 but «(G) = 1. | 


Various conditions force equalities among the parameters; for example, 
x'(G) = 6(G) when G has diameter 2 (Exercise 25). For 3-regular graphs, 
connectivity and edge-connectivity are always equal. 


4.1.11. Theorem. If G is a 3-regular graph, then «(G) = k’(G). 


Proof: Let 5 be a minimum vertex cut (|5S| = «(G)). Since «(G) < x’(G) always, 
we need only provide an edge cut of size |S|. Let H1, Hz be two components of 
G —S. Since S is a minimum vertex cut, each v € S has a neighbor in H,; and 
a neighbor in Ho. Since G is 3-regular, v cannot have two neighbors in H; and 
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two in Ho. For each v € S, delete the edge from v to a member of {H,, Hj} where 
v has only one neighbor. 

These «(G) edges break all paths from H; to Ho except in the case below, 
where a path can enter S via v; and leave via v2. In this case we delete the edge 
to H, for both v, and ve to break all paths from H to H2 through {v, v9}. a 


v2 

v1 
When «(G) < 6(G), a minimum edge cut cannot isolate a vertex. In fact, 
whenever |[S, S]| < 5(G), the set S (and also S) must be much larger than a 


single vertex. This follows from a simple relationship between the size of the 
edge cut [S, $] and the size of the subgraph induced by S. 


4.1.12. Proposition. If S is a set of vertices in a graph G, then 
|[S, SI] = [0-5 d(v)] — 2e(GIs]). 


) 
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Proof: An edge in G[S] contributes twice to >; d(v), while an edge in [S, 5] 
contributes only once to the sum. Since this counts all contributions, we obtain 
Doves A(v) = |[s, S]| + 2e(G[S]). a 
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4.1.13. Corollary. If G is a simple graph and ILS , 5] | < 5(G) for some nonempty 
proper subset S of V(G), then |S| > 5(G). 


Proof: By Proposition 4.1.12, we have 5(G) > >> 
d(v) => 6(G) and 2e(G[S]) < [S| (|S| — 1) yields 


6(G) > |S|5(G) — |S] JS] — 1). 


d(v) — 2e(G[S]). Using 


veS 


This inequality requires |5| > 1, so we can combine the terms involving 5(G) 
and cancel |S| — 1 to obtain |S| > 5(G). | 


As a set of edges, an edge cut may contain another edge cut. For example, 
Ki. has three edge cuts, but one of them contains the other two. The minimal 
non-empty edge cuts of a graph have useful structural properties. 


4.1.14. Definition. A bond is a minimal nonempty edge cut. 


Here “minimal” means that no proper nonempty subset ‘is also an edge cut. 
We characterize bonds in connected graphs. 


4.1.15. Proposition. If G is a connected graph, then an edge cut F is a bond if 
and only if G — F has exactly two components. 
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Proof: Let F = [S, S] be an edge cut. Suppose first that G — F has exactly 
two components, and let F’ be a proper subset of F. The graph G — F’ contains 
the two components of G — F plus at least one edge between them, making it 
connected. Hence F is a minimal disconnecting set and is a bond. 

For the converse, suppose that G — F has more than two components. Since 
G — F is the disjoint union of G[S] and G[,, one of these has at least two com- 
ponents. Assume by symmetry that it is G[S]. We can thus write S = AUB, 
where no edges join A and B. Now the edge cuts [A, A] and [B, B] are proper 
subsets of F, so F is not a bond. a 


BLOCKS 


A connected graph with no cut- ertex need not be 2-connected, since it 


can be K; or Ky. Connected subgraphs without cut-vertices provide a useful 
decomposition of a graph. = 


mm 


4.1.16. Definition. A block ofa graph G is a maximal connected subgraph of 
G that has no cut-vertex. If G itself is connected and has no cut-vertex, 
then G is a block. 


4.1.17. Example. Blocks. If H is a block of G, then H as a graph has no cut- 
vertex, but H may contain vertices that are cut-vertices of G. For example, 
the graph drawn below has five blocks; three copies of Kz, one of K3, and one 
subgraph that is neither a cycle nor a clique. | 


<> 
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4.1.18. Remark. Properties of blocks. An edge of a cycle cannot itself be a block, 
since it is in a larger subgraph with no cut-vertex. Hence an edge is a block if 
and only if it is a cut-edge; the blocks of a tree are its edges. If a block has more 
than two vertices, then it is 2-connected. The blocks of a loopless graph are its 
isolated vertices, its cut-edges, and its maximal 2-connected subgraphs. a 
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4.1.19. Proposition. Two blocks in a graph share at most one vertex. 


Proof: We use contradiction. Suppose that blocks B;, Bz have at least two 
common vertices. We show that B; U Bz is a connected subgraph with no cut- 
vertex, which contradicts the maximality of B; and Bo. 

When we delete one vertex from B;, what remains is connected. Hence 
we retain a path in B; from every vertex that remains to every vertex of 
V(B1) M V(B2) that remains. Since the blocks have at least two common ver- 
tices, deleting a single vertex leaves a vertex in the intersection. We retain 
paths from all vertices to that vertex, so B, U Bz cannot be disconnected by 
deleting one vertex. a 


Every edge by itself is a subgraph with no cut-vertex and hence is in a block. 
We conclude that the blocks of a graph decompose the graph. Blocks in a graph 
behave somewhat like strong components of a digraph (Definition 1.4.12), but 
strong components share no vertices (Exercise 1.4.13a). Thus although blocks 
in a graph decompose the edge set, strong components in a digraph merely 
partition the vertex set and usually omit edges. 

When two blocks of G share a vertex, it must be a cut-vertex of G. The 
interaction between blocks and cut-vertices is described by a special graph. 


ks 


4.1.20.* Definition. The block-cutpoint graph of a graph G is a bipartite 
graph H in which one partite set consists of the cut-vertices of G, and the 
other has a vertex b; for each block B; of G. We include vb; az an edge of H 
if and only if v € B;. 


n 
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When G is connected, its block-cutpoint graph is a tree (Exercise 34) whose 
leaves are blocks of G. Thus a graph G that is not a single block has at least 
two blocks (leaf blocks) that each contain exactly one cut-vertex of G. 

Blocks can be found using a technique for searching graphs. In Depth- 
First Search (DFS), we explore always from the most recently discovered 
vertex that has unexplored edges (also called backtracking). In contrast, 
Breadth-First Search (Algorithm 2.3.8) explores from the oldest vertex, so the 
difference between DFS and BFS is that in DFS we maintain the list of vertices 
to be searched as a Last-In First-Out “stack” rather than a queue. 


4.1.21.* Example. Depth-First Search. In the graph below, one depth-first 
’ search from u finds the vertices in the order u,a,b,c,d,e, f,g. For both BFS 
and DFS, the order of discovery depends on the order of exploring edges from a 
searched vertex. a 
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A breath-first or depth-first search from u generates a tree rooted at u; 
each time exploring a vertex x yields a new vertex v, we include the edge xv. 
This grows a tree that becomes a spanning tree of the component containing 
u. Applications of depth-first search rely on a fundamental property of the 
resulting spanning tree. 


4.1.22.* Lemma. If T is a spanning tree of a connected graph G grown by DFS 
from u, then every edge of G not in T consists of two vertices v, w such that 
v lies on the u, w-path in T. 


Proof: Let vw be an edge of G, with v encountered before w in the depth-first 
search. Because vw is an edge, we cannot finish v before w is added to T. Hence 
w appears somewhere in the subtree formed before finishing v, and the path 
from w to u contains v. | 
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4.1.23.* Algorithm. (Computing the locks of a graph) 
Inpui: A connected graph G. (The blocks of a graph are the blocks of its com- 
ponents, which can be found by depth -first search, so we may assume that G is 
connected.) 
Idea: Build a depth-first search tree T of G, discarding portions of T as blocks 
are identified. Maintain one vertex called ACTIVE. 
Initialization: Pick a root x ¢ V(H); make x ACTIVE; set T = {x}. 
Iteration: Let v denote the current active vertex. 
1) If v has an unexplored incident edge vw, then 
1A) If w ¢ V(T), then add vw to T, mark vw explored, make w ACTIVE. 
1B) If w € V(T), then w is an ancestor of v; mark vw explored.. 
2) If v has no more unexplored incident edges, then 
2A) If v 4 x, and w is the parent of v, make w ACTIVE. If no vertex in 
the current subtree T’ rooted at v has an explored edge to an ancestor above w, 
then V(T’) U {w} is the vertex set of a biocs; record this information and delete 
V(T’) from T. 
2B) If v = x, terminate. a 


4,1.24.* Example. Finding blocks. For the graph below, cne depth-first traver- 
sal from x visits the other vertices in the order a, b, c,d, e, f, g,h,i, j. We find 
blocks in the order {a, b,c, d}, {e, f.g,h}, {a, i}, {x,a,e}, {x, j}. After finding 
each block, we delete the vertices other than the highest. Exercise 36 requests 
a proof of correctness. s 
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EXERCISES 


4.1.1. (—) Give a proof or a counterexample for each statement below. 
a) Every graph with connectivity 4 is 2-connected. 
b) Every 3-connected graph has connectivity 3. 
c) Every k-connected graph is k-edge-connected. 
d) Every k-edge-connected graph is k-connected. 


4.1.2. (—) Give a counterexample to the following statement, add a hypothesis to correct 
it, and prove the corrected statement: Ife is a cut-edge of G, then at least one vertex of 
e is a cut-vertex of G. 


4.1.3. (—) Let G be an n-vertex simple graph other than K,. Prove that if G is not 
k-connected, then G has a separating set of size k-1. 


4.1.4. (—) Prove that a graph G is k-connected if and only if G v K, (Definition 3.3.6) is 
k + r-connected. 


4.1.5. (—) Let G be a connected eek with at least three vertices. Form G’ from G by 
adding an edge with endpoints x, y whenever dg(x, y) = 2. Prove that G’ is 2-connected. 


4.1.6. (—) For a graph G with blocks By, ..., By, prove that n(G) = ()-,_,1(Bi)) —k +1. 


4.1.7. (—) Obtain a formula for the number of spanning trees of a connected graph in 
terms of the numbers of spanning trees of its blocks. 


4.1.8. Determine x(G), «’(G), and 6(G) for each graph G drawn below. 


4.1.9. For each choice of integers k,/, m with 0 < k <1 < m, construct a simple graph G 
with «(G) =k, x’(G) =1, and 6(G) = m. (Chartrand—Harary [1968]) 


4.1.10. (!) Find (with proof) the smallest 3-regular simple graph having connectivity 1. 
4.1.11. Prove that «’(G) = «(G) when G is a simple graph with A(G) < 3. 
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4.1.12. Let n,k be positive integers with n even, k odd, andn > k > 1. Let G be the k- 
regular simple graph formed by placing n vertices on a circle and making each vertex 
adjacent to the opposite vertex and to the (k — 1)/2 nearest vertices in each direction. 
Prove that «(G) = k. (Harary [1962a]) 


4.1.13. In K,,.,, let S consist of a vertices from one partite set and b from the other. 

a) Compute Iky S]| in terms of a,b, m,n. 

b) Use part (a) to prove numerically that «’(K,,.,) = min{m, n}. 

c) Prove that every set of seven edges in K33 is a disconnecting set, but no set of 
seven edges is an edge cut. 


4.1.14. (!) Let G be a connected graph in which for every edge e, there are cycles C; and 
C2 containing e whose only common edge is e. Prove that G is 3-edge-connected. Use 
this to show that the Petersen graph is 3-edge-connected. 


4.1.15. (!) Use Proposition 4.1.12 and Theorem 4.1.11 to prove that the Petersen graph 
is 3-connected. 


4.1.16. Use Proposition 4.1.12 to prove that the Petersen graph has an edge cut of size 
m if and only if 3 < m < 12. (Hint: Consider |[S, S]| for 1 < |S| < 5.) 


4.1.17. Prove that deleting an edge cut of size 3 in the Petersen graph isolates a vertex. 


4.1.18. Let G be a triangle-free graph with minimum degree. at least 3. Prove that if 
n(G) < 11, then G is 3-edge-connected. Show that this inequality is sharp by finding a 
3-regular bipartite graph with 12 vertices that is not 3-edge-connected. (Galvin) 


4.1.19. Prove that «(G) = 4(G) if G is simple and 8(G) > n(G) — 2. Prove that this 
is best possible for each n > 4 by constructing a simple n-vertex graph with minimum 
degree n — 3 and connectivity less than n —3. 


4.1.20. (!) Let G be a simple n-vertex graph with n/2 —1 < 6(G) < n—2. Prove that 
G is k-connected for all k with k < 25(G) +2 —n. Prove that this is best possible for all 
6 => n/2—1 by constructing a simple n-vertex graph with minimum degree 6 that is not 
k-connected for k = 25+ 3—n. (Comment: Proposition 1.3.15 is the special case of this 
when 6(G) = (n — 1)/2.) 


4.1.21. (+) Let G be a simple n-vertex graph with n > k +/ and 6(G) > eae Prove 
that if G — S has more than / components, then |S| > k. Prove that the hypothesis on 
5(G) is best possible for n > k +1 by constructing an appropriate n-vertex graph with 
minimum degree | eee |. (Comment: This generalizes Exercise 4.1.20.) 


4.1.22. (!) Sufficient condition for k + 1-connected graphs. (Bondy [1969]) 

a) Let G be a simple n-vertex graph with vertex degrees d, < --- < d,. Prove that 
if dj > j +k whenever j <n —1-—d,-,, then G is k + 1-connected. (Comment: Exercise 
1.3.64 is the special case of this when k = 0.) 

b) Suppose that 0 < j +k <n. Construct an n-vertex graph G such that «(G) < k 
and G has j vertices of degree j +k — 1, has n— j —k vertices of degree n — j —1, and has 
k vertices of degree n — 1. In what sense does this show that part (a) is best possible? 


4,1.23. (!) Let G be an r-connected graph of even order having no Kj,,,; as an induced 
subgraph. Prove that G has a 1-factor. (Sumner [1974b]) 


4.1.24. (!) Degree conditions for x’ = 5. Let G be a simple n-vertex graph. Use Corollary 
4.1.13 to prove the following statements. 

a) If 5(G) > |n/2], then «'(G) = 6(G). Prove this best possible by constructing for 
each n > 3 a simple n-vertex graph with 6(G) = [n/2]-— 1 and x’(G) < 8(G). 
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b) If d(x) + d(y) => n— 1 whenever x # y, then x’(G) = 5(G). Prove that this is 
best possible by constructing for each n > 4 and 5(G) =m < n/2 — 1 an n-vertex graph 
G with k’(G) < 6(G) = m in which d(x) + d(y) > n — 2 whenever x # y. 


4.1.25. (!) «’(G) = 4(G) for diameter 2. Let G be a simple graph with diameter 2, and 
let [S, S] be a minimum edge cut with |S| < |S]. _ 

a) Prove that every vertex of S has a neighbor in S. 

b) Use part (a) and Corollary 4.1.18 to prove that «’(G) = 6(G). (Plesnik [1975]) 


4.1.26. (!) Let F be a set of edges in G. Prove that F is an edge cut if and only if F 
contains an even number of edges from every cycle in G. For example, when G = C,, 
every even subset of the edges is an edge cut, but no odd subset is an edge cut. (Hint: 
For sufficiency, the task is to show that the components of G — F can be grouped into 
two nonempty collections so that every edge of F has an endpoint in each collection.) 


4.1.27. (!) Let [S, S] be an edge cut. Prove that there is a set of pairwise edge-disjoint 
bonds whose union (as edge sets) is [S, S]. (Note: This is trivial if [S, 5] is itself a bond.) 
4.1.28. (!) Prove that the symmetric difference of two different edge cuts is an edge cut. 
(Hint: Draw a picture illustrating the two edge cuts and use it to guide the proof.) 
4.1.29. (!) Let H be a spanning subgraph of a connected graph G. Prove that H isa 
spanning tree if and only if the subgraph H* = G — E(H) is a maximal subgraph that 
contains no bond. (Comment: See Section 8.2 for a more gerieral context.) 

4.1.30. (—) Let G be the simple graph with-vertex set {1,..., 11} defined by i < j ifand 
only if i, j have a common factor bigger than 1. Determine the blocks of G. 

4.1.31. A cactus is a connected graph in which every block is an edge or a cycle. Prove 
that the maximum number of edges in a simple n-vertex cactus is |3(n — 1)/2]. (Hint: 
lxJ+ ly] <let+yl.) : 


a 
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4.1.32. Prove that every vertex of a graph has even degree if and only if every block is 
Eulerian. 


4.1.33. Prove that a connected graph is k-edge-connected if and only if each of its blocks 
is k-edge-connected. 


4.1.34. (!) The block-cutpoint graph (see Definition 4.1.20). Let H be the block-cutpoint 
graph of a graph G that has a cut-vertex. (Harary—Prins [1966]) 

a) Prove that H is a forest. 

b) Prove that G has at least two blocks each of which contains exactly one cut- 
vertex of G. 

c) Prove that a graph G with k components has exactly k + )),-yg(b(v) — 1) blocks, 
where b(v) is the number of blocks containing v. 

d) Prove that every graph has fewer cut-vertices than blocks. 


4.1.35. Let H and H’ be two maximal k-connected subgraphs of a graph G. Prove that 
they have at most k — 1 common vertices. (Harary-Kodama [1964]) 


4.1.36. Prove that Algorithm 4.1.23 correctly computes blocks of graphs. 


4.1.37. Develop an algorithm to compute the strong components of a digraph. Prove 
that it works. (Hint: Model the algorithm on Algorithm 4.1.23). 
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4.2. k-Connected Graphs 


A communication network is fault-tolerant if it has alternative paths 
between vertices: the more disjoint paths, the better. In this section, we 
prove that this alternative measure of connection is essentially the same as 
k-connectedness. When k = 1, the definition already states that a graph G is 1- 
connected if and only if each pair of vertices is connected by a path. For larger 
k the equivalence is more subtle. 


2-CONNECTED GRAPHS 


We begin by characterizing 2-connected graphs. 


4.2.1. Definition. Two paths from u to v are internally disjoint if they have 
no common internal vertex. 


4.2.2. Theorem. (Whitney [1932a]) A graph G having at least three vertices 
is 2-connected if and only if for each pair u, v € V(G) there exist internally 
disjoint u, v-paths in G. 
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Proof: Sufficiency. When G has internally disjoint u, v-paths, deletion of one 
vertex cannot separate u from v. Since this condition is given for every pair 
u, v, deletion of one vertex cannot make any vertex unreachable from any other. 
We conclude that G is 2-connected. 

Necessity. Suppose that G is 2-connected. We prove by induction on d(u, v) 
that G has internally disjoint u, v-paths. 

Basis step (d(u, v) = 1). When d(u, v) = 1, the graph G — uv is connected, 
since «'(G) > «(G) > 2. Au, v-path in G — uv is internally disjoint in G from 
the u, v-path formed by the edge uv itself. 

Induction step (d(u, v) > 1). Let k = d(u. v). Let w be the vertex before v 
on a shortest u, v-path; we have d(u, w) = k — 1. By the induction hypothesis, 
G has internally disjoint uv, w-paths P and Q. If v e V(P) U V(Q), then we find 
the desired paths in the cycle P U Q. Suppose not. 

Since G is 2-connected, G — w is connected and contains a u, v-path R. If R 
avoids P or Q, we are done, but R may share internal vertices with both P and 
Q. Let z be the last vertex of R (before v) belonging to P U Q. By symmetry, we 
may assume that z € P. We combine the u, z-subpath of P with the z, v-subpath 
of R to obtain a u, v-path internally disjoint from Q U wv. a 
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4.2.3. Lemma. (Expansion Lemma) If G is a k-connected graph, and G’ is 
obtained from G by adding a new vertex y with at least k neighbors in G, 
then G’ is k-connected. 


Proof: We prove that a separating set S of G’ must have size at least k. If y € S, 
then S — {y} separates G, so [S| > k +1. If y ¢ S and N(y) C S, then |S| > k. 
Otherwise, y and N(y) — S lie in a single component of G’ — S. Thus again S$ 
must separate G and |S| > k. | 


Ce DS» 
ee 


4.2.4. Theorem. For a graph G with at least three vertices, the following con- 
ditions are equivalent (and characterize 2-connected graphs). 
A) G is connected and has no cut-vertex. 
B) For all x, y € V(G), there are internally disjoint x, y-paths. 
C) For all x, y € V(G), there is a cycle through x and y. 
D) 5(G) > 1, and every pair of edges in G lies on a common cycle. 


Proof: Theorem 4.2.2 proves AB. 

For BsC, note that cycles eon taunand x and y correspond to pairs of inter- 
nally disjoint x, y-paths. 

For D=C, the condition 5(G) > i implies that vertices x and y are not 
isolated; we then apply the last part of D to edges incident to x and y. If there 
is only one such edge, then we use it and any edge incident to a third vertex. 

To complete the proof, we assume that G satisfies the equivalent properties 
A and C and then derive D. Since G is connected, 5(G) > 1. Now consider two 
edges uv and xy. Add to G the vertices w with neighborhood {u, v} and z with 
neighborhood {x, y}. Since G is 2-connected, the Expansion Lemma (Lemma 
4.2.3) implies that the resulting graph G’ is 2-connected. 

Hence condition C holds in G’, so w and z lie on a cycle C in G’. Since w, z 
each have degree 2, C must contain the paths u, w, v and x, z, y but not the - 
edges uv or xy. Replacing the paths u, w, v and x, z, y in C with the edges uv 


and xy yields the desired cycle through uv and xy in G. | 
u x 
w 2 
Uv y 


4.2.5. Definition. In a graph G, subdivision of an edge wv is the operation of 
replacing uv with a path u, w, v through a new vertex w. 


CD > GED 
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4.2.6. Corollary. If G is 2-connected, then the graph G’ obtained by subdivid- 
ing an edge of G is 2-connected. 


Proof: Let G’ be formed from G by adding vertex w to subdivide uv. To show 
that G’ is 2-connected, it suffices to find a cycle through arbitrary edges e, f of 
G' (by Theorem 4.2.4D). 

Since G is 2-connected, any two edges of G lie on a common cycle (Theorem 
4.2.4D). When our given edges e, f of G’ lie in G, a cycle through them in G is 
also in G’, unless it uses uv, in which case we modify the cycle. Here “modify 
the cycle” means “replace the edge uv with the u, v-path of length 2 through w". 

When e € E£(G) and f &€ {uw, wv}, we modify a cycle passing through e and 
uv in G. When {e, f} = {uw, wv}, we modify a cycle through uv. a 


The class of 2-connected graphs has a characterization that expresses the 
construction of each such graph from a cycle and paths. 


4.2.7. Definition. An ear of a graph G is a maximal path whose internal ver- 
tices have degree 2 in G. An ear decomposition of G is a decomposition 
Pie eicck P, such that Po is a cycle and P; fori > 1 is an ear of Po U---UP;. 


; 


~ telegraag.n 


4.2.8. Theorem. (Whitney [1932a]) A graph is 2-connected if and only if it has 
an ear decomposition. Furthermore, every cycle in a 2-connected graph is 
the initial cycle in some ear decomposition. 


Proof: Sufficiency. Since cycles are 2-connected, it suffices to show that adding 
an ear preserves 2-connectedness. Let u,v be the endpoints of an ear P to be 
added to a 2-connected graph G. Adding an edge cannot reduce connectivity, 
so G + uv is 2-connected. A succession of edge subdivisions converts G + uv 
into the graph G U P in which P is an ear; by Corollary 4.2.6, each subdivision 
preserves 2-connectedness. 

Necessity. Given a 2-connected graph G, we build an ear decomposition of G 
from a cycle C in G. Let Go = C. Let G; be a subgraph obtained by successively 
adding i ears. If G; # G, then we can choose an edge uv of G — E(G;) and an 
edge xy € E(G;). Because G is 2-connected, uv and xy lie on a common cycle 
C’. Let P be the path in C’ that contains uv and exactly two vertices of G;, one 
at each end of P. Now P can be added to G; to obtain a larger subgraph G;+1 
in which P is an ear. The process ends only by absorbing all of G. a 
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Every 2-connected graph is 2-edge-connected, but the converse does not 
hold. Recall that the bowtie is the graph consisting of two triangles sharing 
one common vertex; it is 2-edge-connected but not 2-connected. Since more 
graphs are 2-edge-connected, decomposition of 2-edge-connected graphs needs 
a more general operation. The proof is like that of Theorem 4.2.8. 


4.2.9. Definition. A closed ear in a graph G is a cycle C such that all vertices 
of C except one have degree 2 in G. A closed-ear decomposition of G is 
a decomposition Po,..., P, such that Po is a cycle and P; fori > 1 is either 
an (open) ear or a closed ear in G. 


4.2.10. Theorem. A graph is 2-edge-connected if and only if it has a closed-ear 
decomposition, and every cycle in a 2-edge-connected graph is the initial 
cycle in some such decomposition 


Proof: Sufficiency. Cut-edges are the edges not on cycles (Theorem 1.2.14), so 
a connected graph is 2-edge-connected if and only if évery edge lies on a cycle. 
The initial cycle is 2-edge-connected. When we add a closed ear, its edges form 
a cycle. When we add an open ear P to a connected graph G, a path in G 
connecting the endpoints of P completes a cycle containing all edges of P. In 
each case, the new graph also is connected. Thus adding an open or closed ear 
preserves 2-edge-connectedness. @ 

Necessity. Given a 2-edge- connected graph G, let Po be a cycle in G. Con- 
sider a closed-ear decomposition Po, ..., P; of asubgraph G; of G. When G; #4 G, 
we find an ear to add. Since G is connected. there is an edge uv € E(G) — E(G;) 
with u € V(G;). Since G is 2-edge-connected, uv lies on acycle C. Follow C un- 
til it returns to V(G;), forming up to this point a path or cycle P. Adding P 
to G; yields a larger subgraph G;+; in which P is an open or closed ear. ane 
process ends only by absorbing all of G. 


CONNECTIVITY OF DIGRAPHS 


Our results about k-connected and k-cdge-connected graphs will apply as 
well for digraphs, where we use analogous terminology. 


4.2.11. Definition. A separating set or vertex cut of a digraph Disaset S C 
V(D) such that D — S is not strongly connected. A digraph is k-connected 
if every vertex cut has at least k vertices. The minimum size of a vertex 
cut is the connectivity «(D). 

For vertex sets S, T in a digraph D, let [S, T] denote the set of edges 
with tail in'S and head in T. An edge cut is the set [S, S] for some @ 4 
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S c V(D). A digraph is k-edge-connected if every edge cut has at least k 
edges. The minimum size of an edge cut is the edge-connectivity «’(D). 


4.2.12. Remark. Because | [S, S]| is the number of edges leaving S, we can 
restate the definition of edge-connectivity as follows: A graph or digraph G is k- 
edge-connected if and only if for every nonempty proper vertex subset S, there 
are at least k edges in G leaving S. 

Note that [S, T] is the set of edge from S to T. The meaning of this depends 
on whether we are discussing a graph or a digraph. In a graph, we take all 
edges that have endpoints in both sets. In a digraph, we take only the edges 
with tail in S and head in T. s 


Strong digraphs are similar to 2-edge-connected graphs. 


4.2.13. Proposition. Adding a (directed) ear to a strong digraph produces a 
larger strong digraph. 


Proof: By Remark 4.2.12, a digraph is strong if and only if for every nonempty 
vertex subset there is a departing edge. If we add an open ear or closed ear P 
to a strong digraph D, then for every set S with @ c S C V(D) we already have 
an edge from § to V(D) — S. We need only consider sets that don’t iritersect 
V(D) and sets that contain all of V(D)-but not all of V(P). For every such set, 
there is an edge leaving it along P. | 


m.me/mat 


‘nen can the streets in a road network all be made one-way without mak- 
ing any location unreachable from some other location? In other words, when 
does a graph have a strong orientation? The graph below does not. The obvious 
necessary conditions are sufficient. 


4.2.14. Theorem. (Robbins [1939]) A graph has a strong orientation if and 
only if it is 2-edge-connected. 


Proof: Necessity. If a graph G is disconnected, then some vertices cannot reach 
others in any orientation. If G has a cut-edgeé xy oriented from x to y in an 
orientation D, then y cannot reach x in D. Hence G must be connected and 
have no cut-edge. 

Sufficiency. When G is 2-edge-connected, it has a closed-ear decomposition. 
We orient the initial cycle consistently to.obtain a strong digraph. As we add 
each new ear and direct it consistently, Proposition 4.2.13 guarantees that we 
still have a strong digraph. a 
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Robbins’ Theorem generalizes for all k. When G has a k-edge-connected 
orientation, Remark 4.2.12 implies that G must be 2k-edge-connected. Nash- 
Williams [1960] proved that this obvious necessary condition is also sufficient: a 
graph has a k-edge-connected orientation if and only if it is 2k-edge-connected. 
This is easy when G is Eulerian (Exercise 21), but the general case is diffi- 
cult (see Exercises 36—38). A thorough discussion of this and other orientation 
theorems appears in Frank [1993]. . 


k-CONNECTED AND k-EDGE-CONNECTED GRAPHS 


We have introduced two measures of good connection: invulnerability to 
deletions and multiplicity of alternative paths. Extending Whitney’s Theorem, 
we show that these two notions are the same, for both vertex deletions and edge 
deletions, and for both graphs and digraphs. 

We first discuss the “local” problem of x, y-paths for a fixed pair x, y € V(G). 
These definitions hold both for graphs and for digraphs. 


4.2.15. Definition. Given x, y € V(G), a set S C V(G) — {x, y} is an x, y- 
separator or x, y-cutifG—S has nox, y-path. Let x(x, y) be the minimum 
size of an x, y-cut. Let A(x, y) be the maximum size of a set of pairwise inter- 
nally disjoint x, y-paths. For X, YC V(G), an X, Y-path is a path having 
first vertex in X, last vertex in Y, and no other vertex in X UY. 


An x, y-cut must contain an internal vertex of every x, y-path, and no vertex 
can cut two internally disjoint x, y-paths. Therefore, always x(x, y) > A(x, y). 
Thus the problems of finding the smallest cut and the largest set of paths are 
dual problems, like the duality between matching and covering in Chapter 3. 


4.2.16. Example. In the graph G below, the set S = (b,c, z,d} is an x, y-cut of 
size 4; thus x(x, y) < 4. As shown on the left, G has four pairwise internally 
disjoint x, y-paths; thus A(x, y) > 4. Since x(x, y) > A(x, y) always, we have 
K(x, y) =A(x, y) = 4. 


a b y 


Consider also the pair w, z. As shown on the right, «(w, z) = A(w, z) = 8, 
with {b, c, x} being a minimum w, z-cut. The graph G is 3-connected; for every 
pair u, v € V(G), we can find three pairwise internally disjoint u, v-paths. 

From the equality for internally disjoint paths, we will obtain an analogous 
equality for edge-disjoint paths. Although x (w, z) = 3 above, it takes four edges 
to break all w, z-paths, and there are four pairwise edge-disjoint w, z-paths. @ 
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What we call Menger’s Theorem states that the local equality x(x, y) = 
A(x, y) always holds. The global statement for connectivity and analogous re- 
sults for edge-connectivity and digraphs were observed by others. All are con- 
sidered forms of Menger’s Theorem. More than 15 proofs of Menger’s Theorem 
have been published, some yielding stronger results, some incorrect. (A gap in 
Menger’s original argument was later repaired by K6nig.) 


4.2.17. Theorem. (Menger [1927]) If x, y are vertices of a graph G and xy ¢ 
E(G), then the minimum size of an x, y-cut equals the maximum number 
of pairwise internally disjoint x, y-paths. 


Proof: An x, y-cut must contain an internal vertex from each path in a set 
of pairwise internally disjoint x, y-paths. These vertices must be distinct, so 
K(x, y) = A(x, y). 

To prove equality, we use induction on n(G). Basis step: n(G) = 2. Here 
xy ¢ E(G) yields x(x, y) = A(x, y) = 0. Induction step: n(G) > 2. Let k = 
kc(x, y). We construct k pairwise internally disjoint x, y-paths. Note that since 
N(x) and N(y) are x, y-cuts, no minimum cut properly contains N(x) or N(y). 

Case 1: G has a minimum x, y-cut S other than N(x) or N(y). To obtain 
the k desired paths, we combine x, S-paths and S, y-paths obtained from the 
induction hypothesis (as formed by solid edges shown below). Let Vi be the 
set of vertices on x, S-paths, and let Vz be the set of vertices on S, y-paths. We 
claim that S = V; N V2. Since S$ is a minimal x, y-cut, every vertex of S lies on 
an x, y-path, and hence S C V,M Vo. If v € (Vi N Ve), then following the x, v- 
portion of some x, S-path and then the-v, y-portion of some S, y-path yields an 
x, y-path that avoids the x, y-cut S. This is impossible, so S = V(G,) N V(Go). 
By the same argument, V; omits N(y) — S and V2 omits N (x) — 8. 

Form H; by adding to G[Vi] a vertex y’ with edges from'S. Form Hp by 
adding to G[V2] a vertex x’ with edges to S$. Every x, y-path in G starts with an 
x, S-path (contained in H}), so every x, y’-cutin H; is an x, y-cutin G. Therefore, 
Ky, (x, y’) =k, and similarly «x, (x’, y) = k. 

Since V; omits N(y) — S and V2 omits N(x) — S, both H; and Hp» are smaller 
than G. Hence the induction hypothesis yields A, (x, y’) =k =Ax,(x’, y). Since 
Vi Vo = S, deleting y’ from the k paths in H, and x’ from the k paths in 
Hz yields the desired x, S-paths and S, y-paths in G that combine to form k 
pairwise internally disjoint x, y-paths in G. 


Vi_fes—e V2 _\ 
RN RB Sy DOaN . ee 
x SL / x hy 7 
G Ay Ho 


Case 1 Case 2 
Case 2. Every minimum x, y-cut is N(x) or N(y). Again we construct the 
k desired paths. In this case, every vertex outside {x U N(x) U N(y) U y} is 
in no minimum x, y-cut. If G has such a vertex v, then xg_,(x, y) = k, and 
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applying the induction hypothesis to G — v yields the desired x, y-paths in G. 
Also, if there exists u € N(x)MN(y), then u appears in every x, y-cut; and 
KG—u(x, y) =k —1. Now applying the induction hypothesis to G — v yields k — 1 
paths to combine with the path x, v, y. 

We may thus assume that N(x) and N(y) partition V(G) — (x, y}. Let G’ 
be the bipartite graph with bipartition N(x), N(y) and edge set [N(x), N(y)]. 
Every x, y-path in G uses some edge from N(x) to N(y), so the x, y-cuts in G 
are precisely the vertex covers of G’. Hence £(G’) = k. By the Konig—Egervary 
Theorem, G’ has a matching of size k. These k edges yield k pairwise internally 
disjoint x, y-paths of length 3. | 


Case 2 is needed in the proof because when S$ = N(x), the induction hy- 
pothesis cannot be used to obtain the S, y-paths. 

The statement of Theorem 4.2.17 makes sense also for digraphs The proof 
of the digraph version is exactly the same; we only need to replace N(x) and 
N(y) with Nt(x) and N~(y) throughout. 

We next develop the analogue of Theorem 4.2.17 for edge-disjoint paths, 
which we prove by applying Theorem 4.2.17 to a transformed graph. The main 
part of the transformation is an operation that we will use again in Chapter 7. 


4.2.18. Definition. The line graph of a graph G, written L(G), is the graph 
whose vertices are the edges of G; with ef € E(L(G)) when e = uv and 
f = vw in G. Substituting “digraph” for “graph” in this sentence yields 
the definition of line digraph. For graphs, e and f share a vertex; for 
digraphs, the head of e must be the tail of f. 


Sf, 
L 
£ h e h 
g 
8 
G L(G) L(A) 


When disconnecting y from x by deleting edges, we use notation analogous 
to that of Definition 4.2.15: 4’(x, y) is the maximum size of a set of pairwise 
edge-disjoint x, y-paths, and «'(x, y) is the minimum number of edges whose 
deletion makes y unreachable from x. Elias—Feinstein—Shannon [1956] and 
Ford—Fulkerson [1956] proved that always A’(x, y) = x’(x, y) (using the meth- 
ods of Section 4.3). We allow multiple edges and allow xy € E(G). 


4.2.19. Theorem. If x and y are distinct vertices of a graph or digraph G, 
then the minimum size of an x, y- -disconnecting set of edges equals the 
maximum number of pairwise edge-disjoint x, y-paths. 


Proof: Modify G to obtain G’ by adding two new vertices s, t and two new edges 
sx and yt. This does not change x’(x, y) or A’(x, y), and we can think of each 
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path as starting from the edge sx and ending with the edge yr. A set of edges 
disconnects y from x in G if and only if the corresponding vertices of L(G’) form 
an sx, yt-cut. Similarly, edge-disjoint x, y-paths in G become internally disjoint 
sx, yt-paths in L(G’), and vice versa. Since x # y, we have no edge from sx to 
ty in L(G’). Applying Theorem 4.2.17 to L(G’) yields 


KG(x, ¥) = K1L(G) (SX, Yt) = ALE (SX, Yt) = AG(X, y). a 


The global version for k-connected graphs, observed first by Whitney 
[1932a], is also commonly called Menger’s Theorem. The global versions for 
edges and digraphs appeared in Ford—Fulkerson [1956]. 


4.2.20. Lemma. Deletion of an edge reduces connectivity by at most 1. 


Proof: We discuss only graphs; the argument for digraphs is similar (Exercise 
7). Since every separating set of G is a separating set of G — xy, we have 
x(G — xy) < x(G). Equality holds unless G — xy has a separating set S that 
is smaller than «(G) and hence is not a separating set of G. Since G — S is 
connected, G — xy — S has two components G[X] and G[Y], with x € X and 
y € Y. InG —S, the only edge joining X and Y is xy. 

If |X| > 2, then SU {x} is a separating set of G, and «(G)-< k(G—xy)+1. If 
|Y| > 2, then again the inequality holds. In the remaining case, |S| = n(G) — 2. 
Since we have assumed that [S| < «(G), |S| = n(G) — 2 implies that «(G) > 
n(G) — 1, which holds only for a complete graph. Thus «(G — xy) =n(G) —-2= 
x(G) — 1, as desired. | 


4.2.21. Theorem. The connectivity of G equals the maximum & such that 
A(x, y) => k for all x, y € V(G). The edge-connectivity of G equals the max- 
imum k such that A’(x, y) > k for all x, y € V(G). Both statements hold for 
graphs and for digraphs. 


Proof: Since «'(G) = min,y,<y(g) « (x, y), Theorem 4.2.19 immediately yields 
the claim for edge-connectivity. 

For connectivity, we have x(x, y) = A(x, y) for xy ¢ E(G), and x(G) is the 
minimum of these values. We need only show that A(x, y) cannot be smaller 
than «(G) when xy € E(G). Certainly deletion of xy reduces i(x, y) by 1, since 
xy itself is an x, y-path and cannot contribute to any other x, y-path. With this, 
Theorem 4.2.17, and Lemma 4.2.20, we have 


Ag(x, y) = 1+ AG-xy(@, y) = 1+ KkG-2z(%, y) = 1+K«(G— xy) = K(G). a 
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APPLICATIONS OF MENGER’S THEOREM 


Dirac extended Menger’s Theorem to other families of paths. 


4.2.22. Definition. Given a vertex x and a set U of vertices, an x, U-fan is a 
set of paths from x to U such that any two of them share only the vertex x. 


4.2.23. Theorem. (Fan Lemma, Dirac [1960]). A graph is k-connected if and 
only if it has at least k+ 1 vertices and, for every choice of x, U with |U| > k, 
it has an x, U-fan of size k. 


Proof: Necessity. Given k-connected graph G, we construct G’ from G by adding 
a new vertex y adjacent to all of U. The Expansion Lemma (Lemma 4.2.3) im- 
plies that G’ also is k-connected, and then Menger’s Theorem yields k pairwise 
internally disjoint x, y-paths in G’. Deleting y from these paths produces an 
x, U-fan of size k in G. 

Sufficiency. Suppose that G satisfies the fan condition. For v « V(G) and 
U = V(G) — {v}, there is a v, U-fan of size k; thus 6(G) > k. Given w, z € V(G), 
let U = N(z). Since |U| > k, we have an w, U-fan of size k; extend each path 
by adding an edge to z. We obtain k pairwise internally disjoint w, z-paths, so 
A(w, z) > k. This holds for all w, z € V(G), so G is k-connected. a 


The Fan Lemma generalizes considerably. Whenever X and ¥ are disjoint 
sets of vertices in a k-connected graph G and we specify integers at X and Y 
summing to k in each set, there are k pairwise internally disjoint X, Y-paths 
with the specified number ending at each point (Exercise 28). The Fan Lemma 
also yields the next result. 


4,2.24.* Theorem. (Dirac [1960]) If G is a k-connected graph (with k > 2), and 
S is a set of k vertices in G, then G has a cycle including S in its vertex set. 


Proof: We use induction on k. Basis step (k = 2): Theorem 4.2.2 (or Theorem 
4.2.21) implies that any two vertices are connected by two internally disjoint 
paths, which form a cycle containing them. 

Induction step (k > 2): With G and S as specified, choose x € S. Since G is 
also k — 1-connected, the induction hypothesis implies that all of S — {x} lies on 
a cycle C. Suppose first that n(C) = k — 1. Since G is k — 1-connected, we have 
an x, V(C)-fan of size k — 1, and the paths of the fan to two consecutive vertices 
of C enlarge the cycle to include x. 

Hence we may assume that n(C) > k. Since G is k-connected, G has an 
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x, V(C)-fan of size k. We claim that again the fan has two paths forming a 
detour from C that includes x while keeping S — {x}. Let v1,..., u_1 be the 
vertices of § — {x} in order on C, and let V; be the portion of V(C) from v; up to 
but not including v;.; (here uy, = v}). 

The sets Vj,..., Vi_1 partition V(C) into k — 1 disjoint sets. Since the 
x, V(C)-fan has k paths, two of them enter V(C) in one of these sets, by the 
pigeonhole principle. Let u, u’ be the vertices where these paths reach C. Re- 
placing the u, u’-portion of C by the x, u-path and x, u’-path in the fan builds a 


new cycle that contains x and all of S — {x}. | 
u’ 
C G 
k=4 
u 
U3 U9 


Many applications of Menger’s Theorem involve modeling a problem so 
that the desired objects correspond to paths in a graph or digraph, often by 
graph transformation arguments. For example, given sets A = Aj,..., Am with 
union X, a system of distinct representatives (SDR) is a set of distinct ele- 
ments x1,..., Xm such that x; € A;. A necessary and sufficient condition for the 
existence of an SDR is that |LU;-, Ai| > {/| for all J ¢ [m]. It is easy to prove this 
from Hall’s Theorem by modeling A with an appropriate bipartite graph (Ex- 
ercise 3.1.19). Indeed, Hall’s Theorem was originally proved in the language of 
SDRs and is equivalent to Menger’s Theorem (Exercise 23). 

Ford and Fulkerson considered a more difficult problem. Let A = 
A1,...,Am and B = By,...,B, be two families of sets. We may ask when 
there is a common system of distinct representatives (CSDR), meaning a 
set of m elements that is both an SDR for A and an SDR for B. They found a 
necessary and sufficient condition. 


4,.2.25.* Theorem. (Ford—Fulkerson {[1958]) Families A = {A,...,A,,} and 


B = {B,,..., B,} have acommon system of distinct representatives (CSDR) 
if and only if 
IU Ai) N (UL B))| > |1| + |J| —m for each pair J, J C [m]. 
iel jeJ 


Proof: We create a digraph G with vertices qj,...,dm and b1,...,bm, plus a 
vertex for each element in the sets and special vertices s, t. The edges are 


{sa;: A; € A} {ajx: xe€E A;} 
{bjt: B; € B} {xb;: xeE B;} 
Each s, t-path selects a member of the intersection of some A; and some B;. 


There is a CSDR if and only if there is a set of m pairwise internally disjoint 
s,t-paths. By Menger’s Theorem, it suffices to show that the stated condition 
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is equivalent to having no s, t-cut of size less than m. Given a set R C V(G) — 
{s,t}, let J = {a;} — R and J = {bj} — R. The set R is ans, t-cut if and only if 
(Ue, Ai) (jes B;) C R. For an s, t-cut R, we thus have 


IR = | n(_JB)| + (om = 11) + (m= ID. 
iel jeJ 


This lower bound is at least m for every s, t-cut if and only if the stated condition 
holds. | 


4,2.26.* Example. Digraph for CSDR.In the example above, the elements are 
(1, 2,3, 4}, A = (12, 23, 31}, and B = {14, 24, 1234}. Suppose that RM {a;} = 
{a1, a2} and RM {b;} = (by, bg}. In the arguiment, we set J = {a3} and J = {b3}, 
and we observe that R is an s, t-cut if and only if it also contains {1, 3}, which 
equals (ier Aj) A (Ujes Bj). 7 


EXERCISES 


4.2.1. (—) Determine x (u, v) and «’(u, v) in the graph drawn below. (Hint: Use the dual 
problems-to give short proofs of optimality.) 


4.2.2. (—) Prove that if G is 2-edge-connected and G’ is obtained from G by subdividing 
an edge of G, then G’ is 2-edge-connected. Use this to prove that every graph having a 
closed-ear decomposition is 2-edge-connected. (Comment: This is an alternative proof 
of sufficiency for Theorem 4.2.10.) 
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4.2.3. (—) Let G be the digraph with vertex set [12] in which i —> j if and only if i 
divides j. Determine «(1, 12) and «’(1, 12). 


4.2.4, (—) Prove or disprove: If P is a u, v-path in a 2-connected graph G, then there is 
au, v-path Q that is internally disjoint from P. 


4.2.5. (—) Let G be a simple graph, and let H(G) be the graph with vertex set V(G) 
such that uv € E(A) if and only if u, v appear on a common cycle in G. Characterize the 
graphs G such that A is a clique. 


4.2.6. (—) Use results of this section to prove that a simple graph G is 2-connected if and 
only if G can be obtained from C3 by a sequence of edge additions and edge subdivisions. 


4.2.7. Let xy be an edge in a digraph G. Prove that k(G — xy) > k(G) — 1. 


4.2.8. Prove that a simple graph G is 2-connected if and only if for every triple (x, y, z) 
of distinct vertices, G has an x, z-path through y. (Chein [1968]) 


4.2.9. Prove that a graph G with at least four vertices is 2-connected if and only if for 
every pair X,Y of disjoint vertex subsets with |X|,|Y| => 2, there exist two completely 
disjoint paths P,, Ps in G such that each has an endpoint in X and an endpoint in Y and 
no internal vertex in X or Y. 


4.2.10. A greedy ear decomposition of a 2-connected graph is an ear decomposition 
that begins with a longest cycle and iteratively adds a longest ear from the remaining 
graph. Use a greedy ear decomposition to prove that every 2-connected claw-free graph 
G has {n(G)/3] pairwise-disjoint copies of Ps. (Kaneko—Kelmans—Nishimura [2000]) 


4.2.11. (!) For a connected graph G with atJeast three vertices, prove that the following 
statements are equivalent (use of Menger’ oe is permitted). 

A) G is 2-edge-connected. 

B) Every edge of G appears in a cycle. 

C) G has a closed trail containing any specified pair of edges. 

D) G has a closed trail containing any specified pair of vertices. 


4.2.12. (!) Use Menger’s Theorem to prove that «(G) = «'(G) when G is 3-regular (The- 
orem 4.1.11). 


4.2.13. (!) Let G be a 2-edge-connected graph. Define a relation R on E(G) by (e, f) E R 
ife = f or if G — e — f is disconnected. (Lovasz [1979, p277]) 

a) Prove that (e, f) € Rif and only if e, f belong to the same cycles. 

b) Prove that R is an equivalence relation on E(G). 

c) For each equivalence class F, prove that F is contained in a cycle. 

d) For each equivalence class F, prove that G — F has no cut-edge. 


4.2.14. (!) Au, v-necklace is a list of cycles Ci, ..., C, such that u € Ci, v € C;, con- 
secutive cycles share one vertex, and nonconsecutive cycles are disjoint. Use induction 
on d(u, v) to prove that a graph G is 2-edge-connected if and only if for all u, v € V(G) 
there is a u, v-necklace in G. 


>-S0: 
4.2.15. (+) Let v be a vertex of a 2-connected graph G. Prove that v has a neighbor u 
such that G — u — v is connected. (Chartrand—Lesniak [1986, p51]) 
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4.2.16. (+) Let G be a 2-connected graph. Prove that if 7;, Tz are two spanning trees 
of G, then 7; can be transformed into 7, by a sequence of operations in which a leaf is 
removed and reattached using another edge of G. 


4.2.17. Determine the smallest graph with connectivity 3 having a pair of nonadjacent 
vertices linked by four pairwise internally disjoint paths. 


4.2.18, Let G be a graph without isolated vertices. Prove that if G has no even cycles, 
then everv block of G is an edge or an odd cycle. 


4.2.19. (!) Membership in common cycles. 

a) Prove that two distinct edges lie in the same block of a graph if and only if they 
belong to a common cycle. 

b) Given e, f,g € E(G), suppose that G has a cycle through e and f and a cycle 
through f and g. Prove that G also has a cycle through e and g. (Comment: This 
problem implies that for graphs without cut-edges, “belong to a common cycle” is an 
equivalence relation whose equivalence classes are the edge sets of blocks.) 


4.2.20. Prove that the hypercube Q, is k-connected by constructing k pairwise internally 
disjoint x, y-paths for each vertex pair x, y € V(Q,). 


4.2.21. (!) Let G be a 2k-edge-connected graph with at most two vertices of odd degree. 
Prove that G has a k-edge-connected orientation. (Nash-Williams [1960]) 


4.2.22. (!) Suppose that «(G) = k and diam G =d. Prove that n(G) > k(d —1)+2 and 
a(G) > [(1+d)/2]. For each k > 1 and d > 2, construct a grdph for which equality 
holds in both bounds. 


4.2.23. (!) Use Menger’s Theorem (x(x, y) = = A(x, y) when xy ¢ E(G)) to prove the 
Konig—Egervary Theorem (a’(G) = £(G) when G is bipartite). 


21e/math 


4.2.24, (!) Let G be a k-connected graph, and let S,T be disjoint subsets of V(G) with 
size at least k. Prove that G has k pairwise disjoint S, T-paths. 


4.2.25. (*) Show that Theorem 4.2.24 is best possible by constructing for each k a k- 
connected graph having k + 1 vertices that do not lie on a cycle. 


4.2.26. For k > 2, prove that a graph with at least k + 1 vertices is k-connected if and 
only if for every T C S € V(G) with |S| = k and |T| = 2, there is a cycle in G that 
contains T and avoids S — T. (Lick [1973]) 


4.2.27. A vertex k-split of a graph G is a graph H obtained from G by replacing one 
vertex x € V(G) by two adjacent vertices x;, x2 such that dy(x;) > k and that Ny (x1) U 
Nu (x2) = Ne (x) U (x1, x2}. 

a) Prove that every vertex k-split of a k-connected graph is k-connected. 

b) Conclude that any graph obtained from a “wheel” W, = K; V C,_1 (Definition 
3.3.6) by a sequence of edge additions and vertex 3-splits on vertices of degree at least 
4 is 3-connected. (Comment: Tutte [1961b] proved also that every 3-connected graph 
arises in this way. The characterization does not extend easily for k > 3.) 
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4.2.28. (!) Let X and Y be disjoint sets of vertices in a k-connected graph G. Let u(x) for 
x € X and w(y) for y € Y be nonnegative integers such that }°,., u(x) = >) -y w(y) =k. 
Prove that G has k pairwise internally disjoint X, Y-paths so that u(x) of them start at 
x and w(y) of them end at y, for x € X and ye Y. 


4.2.29. Given a graph G, let D be the digraph obtained by replacing each edge with two 
oppositely-directed edges having the same endpoints (thus D is the symmetric digraph 
with underlying graph G). Assume that for all x, y ¢ V(D) both «,(x, y) = A(x, y) and 
Kp(x, y) = Ap(x, y) hold, the latter applying only when x 4 y. Use this hypothesis to 
prove that also «¢(x, y) = AG (x, y) and kg(x, y) =Ac(x, y), the latter for x # y. 


4.2.30. (!) Prove that applying the expansion operation of Example 1.3.26 to a 3- 
connected graph yields a 3-connected graph. Obtain the Petersen graph from K, by 
expansions. (Comment: Tutte [1966a] proved that a 3-regular graph is 3-connected if 
and only if it arises from K4 by a sequence of these operations.) 


4.2.31. Let G be a k-connected simple graph. 

a) Let C and D be two cycles in G of maximum length. For k = 2 and k = 8, prove 
that C and D share at least k vertices. (Hint: If they don’t, construct a longer cycle.) 

b) For each k > 2, construct a k-connected graph that has distinct longest cycles 
with only k common vertices. (Hint: Ke,4 works for k = 2.) 


4.2.32. Graph splices. Let G,; and G2 be disjoint k-connected graphs with k > 2. Choose 
v; € V(G;) and vg € V(G2). Let B be a bipartite graph with partite sets Ng, (v1) and 
No, (v2) that has no isolated vertex and has a matching of size at least k. Prove that 
(G, — v1) U (G2 — v2) U Bis k-connected. = 


4.2.33. («) Prove Hall’s Theorem from Theorem 4.2.25. 


ath 


1 


4.2.34, A k-connected graph G is minimally k-connected if for every e ¢ E(G), the 
graph G — ¢ is not k-connected. Halin [1969] proved that 5(G) = k when G is minimally 
k-connected. Use ear decomposition to prove this for k = 2. Conclude that a minimally 
2-connected graph G with at least 4 vertices has at most 2n(G) — 4 edges, with equality 
only for K2,,-2. (Dirac [1967]) 


4.2.35. Prove that if G is 2-connected, then G — xy is 2-connected if and only if x and y 
lie on a cycle in G — xy. Conclude that a 2-connected graph is minimally 2-connected if 
and only if every cycle is an induced subgraph. (Dirac [1967], Plummer [1968]) 


4.2.36. (!) For 5 C V(G), let d(S) = |{s, S]]. -Let X and Y be nonempty proper vertex 
subsets of G. Prove that d(X NY)+d(X UY) < d(X) + d(Y). (Hint: Draw a picture and 
consider contributions from various types of edges.) 


4.2.37. (+) A k-edge-connected graph G is minimally k-edge-connected if for every 
e € E(G) the graph G — e is not k-edge-connected. Prove that 6(G) = k when G is 
minimally k-edge-connected. (Hint: Consider a minimal set S such that |S, S]| =k. If 


|S| # 1, use G — e for some e € E(G[S]) to obtain another set 7 with |(7, T]| = k such 
that S, T contradict Exercise 4.2.36.) (Mader [1971]; see also Lovasz [1979, p285]) 


4.2.38. Mader [1978] proved the following: “If z is a vertex of a graph G such that 
dg(z) ¢ {0, 1, 3} and z is incident to no cut-edge, then z has neighbors x and y such that 
KG—xz-yz+xy(U, V) = Kg(u, v) for all u, v € V(G) — {z}.” Use Mader’s Theorem and Exercise 
4,2.37 to prove Nash-Williams’ Orientation Theorem: every 2k-edge-connected graph 
has a k-edge-connected orientation. (Comment: A weaker version of Mader’s Theorem, 
given in Lovasz [1979, p286—288], also yields Nash-Williams’ Theorem in the same way.) 
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4.3. Network Flow Problems 


Consider a network of pipes where valves allow flow in only one direction. 
Each pipe has a capacity per unit time. We model this with a vertex fur each 
junction and a (directed) edge for each pipe, weighted by the capacity. We also 
assume that flow cannot accumulate at a junction. Given two locations s, t in 
the network, we may ask “what is the maximum flow (fer unit time) from s to t?” 

This question arises in many contexts. The network may represent roads 
with traffic capacities, or links in a computer network with data transmission 
capacities, or currents in an electrical network. There are applications in in- 
dustrial settings and to combinatorial min-max theorems. The seminal book on 
the subject is Ford—Fulkerson [1962]. More recently, Anuja-Magnanti—Orlin 
[1993] presents a thorough treatment of network flow problems. 


4.3.1. Definition. A network is a digraph wiin a nonnegative capacity c(e) 
on each edge e and a distinguished source vertex s and sink vertex f. 
Vertices are also called nodes. A flow f assigns a value f(e) to each elge 
e. We write f*+(v) for the total flow on edges leaving v and f~(v) for the 
total flow on edges entering v. A flow is feasible if it satisfies the ca- 
pacity constraints 0 < f(e) < ce) for each edge and the conservation 
constraints f+(v) = f~(v) for each node v ¢ {s, t}. 


o 


MAXIMUM NETWORK FLOW 


We consider first the problem of maximizing the net flow into the sink. 


4.3.2. Definition. The value val(/) of a flow f is the net flow f~(t) — ft(t) 
into the sink. A maximum flow is a feasible flow of maximum value. 


4.3.3. Example. The zero flow assigns flow 0 to each edge; this is feasible. 
In the network below we illustrate a nonzero feasible flow. Each capacities are 
shown in bold, flow values in parentheses. Our flow f assigns f(sx) = f(vt) = 
0, and f(e) = 1 for every other edge e. This is a feasible flow of value 1. 


A path from the source to the sink with excess capacity would allow us to 
increase flow. In this example, no path remains with excess capacity, but the 
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flow f’ with f’(vx) = 0 and f’(e) = 1 for e # vx has value 2. The flow f is 
“maximal” in that no other feasible flow can be found by increasing the flow on 
some edges, but f is not a maximum flow. 

We need a more general way to increase flow. In addition to traveling 
forward along edges with excess capacity, we allow traveling backward (against 
the arrow) along edges where the flow is nonzero. In this example, we can 
travel from ¥ to x to v tot. Increasing the flow by 1 on sx and vt and decreasing 
it by one on vx changes f into f’. | 


4.3.4. Definition. When f is a feasible flow in a network N, an f-augmenting 
path is a source-to-sink path P in the underlying graph G such that for 
each e € E(P), 

a) if P follows e in the forward direction, then f(e) < c(e). 

b) if P follows e in the backward direction, then f(e) > 0. 
Let €(e) = c(e) — f(e) when e is forward on P, and let €(e) = f(e) whene is 
backward on P. The tolerance of P is min,¢z,p) €(e). 


As in Example 4.3.3, an f-augmenting path leads to a flow with larger 
value. The definition of f-augmenting path ensures that the tolerance is posi- 
tive; this amount is the increase in the“flow value. 


) 


be 


4.3.5. Lemma. If P is an f -augmenting path with tolerance z, then changing 
flow by +z on edges followed forward by P and by —z on edges followed 
backward by P produces a feasible flow f’ with val( f’) = val(f) + z. 


Proof: The definition of tolerance ensures that 0 < f’(e) < c(e) for every edge 
e, so the capacity constraints hold. For the conservation constraints we need 
only check vertices of P, since flow elsewhere has not changed. 

The edges of P incident to an internal vertex v of P occur in one of the four 
ways shown below. In each case, the change to the flow out of v is the same as 
the change to the flow into v, so the net flow out of v remains 0 in /’. 


Finally, the net flow into the sink t increases by z. | 
+ UF + Do = 
— vost = ) = 


The flow on backward edges did not disappear; it was redirected. In ef- 
fect, the augmentation in Example 4.3.3 cuts the flow path and extends each 
portion to become a new flow path. We will soon describe an algorithm to find 
augmenting paths. 

Meanwhile, we would like a quick way to know when our present flow is a 
maximum flow. In Example'4.3.3, the central edges seem to form a “bottleneck”; 
we only have capacity 2 from the left half of the network to the right half. This 
observation will give us a PROOF that the flow value can be no larger. 
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4.3.6. Definition. In a network, a source/sink cut [S, T] consists of the edges 
from a source set S to a sink set 7, where S and T partition the set 
of nodes, with s € S andt € T. The capacity of the cut [S, T], written 
cap(S, 7), is the total of the capacities on the edges of [S, T]. 


ro 


Keep in mind that in a digraph [S, T] denotes the set of edges with tail in 

S and head in T. Thus the capacity of a cut [S, T] is completely unaffected by 

edges from T to S. 
Given a cut [S, 7], every s, t-path uses at least one edge of [S, T], so intu- 

ition suggests that the value of a feasible flow should be bounded by cap(S, T). 

To make this precise, we extend the notion of net flow to sets of nodes. Let 

f+t(U) denote the total flow on edges leaving U, and let f~(U) be the total flow 

on edges entering U. The net flow out of U is then ft(U) — f~(U). 

4.3.7. Lemma. If U is a set of nodes in a network, then the net flow out of U 
is the sum of the net flows out ofthe nodes in U. In particular, if f is a 
feasible flow and [S, T] is a source/sink cut, then the net flow out of S and 
net flow into T equal val(/). E 


Proof: The stated claim is that 
ft) - f-U) = DV eylft@) -— f-@)I. 


We consider the contribution of the flow f(xy) on an edge xy to each side of 
the formula. If x, y € U, then f(xy) is not counted on the left, but it contributes 
positively (via ft(x)) and negatively (via f~(y)) on the right. If x, y ¢ U, then 
f (xy) contributes to neither sum. Ifxy e€ [U, U], then it contributes positively to 
each sum. If xy € [U, U], then it contributes negatively to each sum. Summing 
over all edges yields the equality. 

When [S, 7] is a source/sink cut and f is a feasible flow, net flow from nodes 
of S sums to f*(s) — f~(s), and net flow from nodes of T sums to ft(t) — f~(t), 
which equals —val(f). Hence the net flow across any source/sink cut equals 
both the net flow out of s and the net flow into ¢. s 
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4.3.8. Corollary. (Weak duality) If f is a feasible flow and [S, T] is a source/sink 
cut, then val(f) < cap(S, T). 


Proof: By the lemma, the value of f equals the net flow out of S. Thus 
val(f) = f*(S) — f-(S) s f7(S), 


since the flow into S is no less than 0. Since the capacity constraints require 
f*(S) < cap(S, T), we obtain val(f) < cap(S, T). | 
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Among source/sink cuts, one with minimum capacity yields the best bound 
on the value of a flow. This defines the minimum cut problem. The max flow 
and min cut problems on a network are dual optimization problems.! Given a 
flow with value a and a cut with value a, the duality inequality in Corollary 
4.3.8 PROVES that the cut is a minimum cut and the flow is a maximum flow. 

If every instance has solutions with the same value to both the max prob- 
lem and the min problem (“strong duality”), then a short proof of optimality 
always exists. This does not hold for all dual pairs of problems (recall matching 
and covering in general graphs), but it holds for max flow and min cut. 

The Ford—Fulkerson algorithm seeks an augmenting path to increase the 
flow value. If it does not find such a path, then it finds a cut with the same 
value (capacity) as this flow; by Corollary 4.3.8, both are optimal. If no infi- 
nite sequence of augmentations is possible, then the iteration leads to equality 
between the maximum flow value and the minimum cut capacity. 


4.3.9. Algorithm. (Ford—Fulkerson labeling algorithm) 
Input: A feasible flow f in a network. 
Output: An f-augmenting path or a cut with capacity val(/). 
Idea: Find the nodes reachable from s by paths with positive tolerance. Reach- 
ing t completes an f-augmenting path. During the search, R is the set of nodes 
labeled Reached, and S is the subset of R labeled Searched. 
Initialization: R = {s}, 5 = @. 
Iteration: Choose v € R—S. 
For each exiting edge vw with f(vw) < c(vw) and w ¢ R, add w to R. 
For each entering edge uv with f (uv > O)andu ¢ R, add u to R. 
Label each vertex added to R as “reached”, and record v as the vertex reaching 
it. After exploring all edges at v, add v to S. 
If the sink t has been reached (put in R), then trace the’path reaching ft 
to report an f-augmenting path and terminate. If R = S, then return the cut 
[S, S] and terminate. Otherwise, iterate. | 
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4.3.10. Example. On the left below is the network of Example 4.3.3 with the 
flow f. We run the labeling algorithm. First we search from s and find excess 
capacity to uv and x, labeling them reached. Now we haveu,v € R—S. There 
is no excess capacity on uv or xy, so searching from u reaches nothing, and also 


(1)1 


x (11 y 


TThe precise notion of “dual problem” comes from linear programming. For our pur- 
poses, dual problems are a maximization problem and a minimization problem such 
that a < b whenever a and b are the values of feasible solutions to the max problem and 
min problem, respectively. See Section 8.1 for further discussion. 
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searching from x does not reach y. However, there is nonzero flow on vx. Thus 
we label v from x. Now v is the only element of R — S, and searching from v 
reaches t. We labeled t from v, v from x, and x from s, so we have found the 
augmenting path s, x, v, ¢. 

The tolerance on this path is 1, so the augmentation increases the flow 
value by 1. In the new flow f’ shown on the right, every edge has unit flow 
except f’(vx) = 0. When we run the labeling algorithm again, we have excess 
capacity on su and sx and can label {u, x}, but from these nodes we can label no 
others. We terminate with R = S = {s,u,x}. The capacity of the resulting cut 
[S, S] is 2, which equals val(f’) and proves that f’ is a maximum flow. | 


Repeated use of the labeling algorithm allows us to solve the maximum 
flow problem and prove the strong duality relationship. 


4.3.11. Theorem. (Max-flow Min-cut Theorem—Ford and Fulkerson [1956]) In 
every network, the maximum value of a feasible flow equals the minimum 
capacity of a source/sink cut. 


Proof: In the max-flow problem, the zero flow (f(e) = 0 for all e) is always a 
feasible flow and gives us a place to start. Given a feasible flow, we apply the 
labeling algorithm. It iteratively adds-vertices to S (each vertex at most once) 
and terminates with ¢ € R (“breakthrough”) or with S = R. 

In the breakthrough case, we have-an f-augmenting path and increase the 
flow value. We then repeat the labeling algorithm. When the capacities are 
rational, each augmentation increases the flow by a multiple of 1/a, where a 
is the least common multiple of the denominators, so after finitely many aug- 
mentations the capacity of some cut is reached. The labeling algorithm then 
terminates with S = R. 

When terminating this way, we claim that [S, 7] is a source/sink cut with 
capacity val(f), where T = S and f is the present flow. It is a cut because s € S 
and ¢t ¢ R= S. Since applying the labeling algorithm to the flow f introduces 
no node of T into R, no edge from S to T has excess capacity, and no edge from 
T to S has nonzero flow in f. Hence f*(S) = cap(S, 7) and f~(S) =0. 

Since the net flow out of any set containing the/source but not the sink is 
val(f), we have proved 


val(f) = f*(S) — f-(S) = f*(S) =cap(S,T). B 


This proof of Theorem 4.3.11 requires rational capacities; otherwise, Algo- 
rithm 4.3.9 may yield augmenting paths forever! Ford and Fulkerson provided 
an example of this with only ten vertices (see Papadimitriou—Steiglitz [1982, 
p126-128]). Edmonds and Karp [1972] modified the labeling algorithm to use 
at most (n? — n)/4 augmentations in an n-vertex network and work for all real 
capacities. As in the bipartite matching problem (Theorem 3.2.22), this is done 
by searching always for shortest augmenting paths. Faster algorithms are now 
known; again we cite Ahuja—Maynanti—Orlin [1993] for a thorough discussion. 
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INTEGRAL FLOWS 


In combinatorial applications, we typically have integer capacities and 
want a solution in which the flow on each edge is an integer. 


4.3.12. Corollary. (Integrality Theorem) If all capacities in a network are in- 
tegers, then there is a maximum flow assigning integral flow to each edge. 
Furthermore, some maximum flow can be partitioned into flows of unit 
value along paths from source to sink. 


Proof: In the labeling algorithm of Ford and Fulkerson, the change in flow 
value when an augmenting path is found is always a flow value or the difference 
between a flow value and a capacity. When these are integers, the difference is 
also an integer. Starting with the zero flow, this implies that there is no first 
time when a noninteger flow appears. 

The algorithm thus produces a maximum flow with integer flow on each 
edge. At each internal node, we now match units of entering flow to units of 
exiting flow. This forms s, t-paths and perhaps cycles. Ifa cycle arises, then we 
decrease flow on its edges by 1 to eliminate it without changing the flow value. 
This leaves val( f) paths from s to t, each corresponding to a unit of flow. | 


=k 


The integrality theorem yields paths of unit flow. In applications, we build 
networks where these units of flow have meaning. 

The next two remarks show that the Max-flow Min-cut Theorem for net- 
works with integer capacities is almost the same statement as Menger’s Theo- 
rem for edge-disjoint paths in digraphs. 
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4.3.13. Remark. Menger from Max-flow Min-cut. When x, y are vertices in a 
digraph D, we can view D as a network with source x and sink y and capacity 
1 on every edge. Capacity 1 ensures that units of flow from x to y correspond 
te pairwise edge-disjoint x, y-paths in D. Thus a flow of value k yields a set of 
k such paths. 

Similarly, every source/sink partition S, T defines a set of edges whose dele- 
tion makes y unreachable from x: the set [S, T]. Since every capacity is 1, the 
size of this set is cap(S, T). 

The paths and the edge cut we have obtained might not be optimal, but by 
the Max-flow Min-cut Theorem we have 


Np (x, y) = maxval(f) = mincap(S, T) > «p(x, y). 


Since always x’(x, y) > A’(x, y), equality now holds. | 
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4.3.14. Remark. Max-flow Min-cut from Menger. To show that Menger’s The- 
orem implies the Max-flow Min-cut Theorem for rational capacities, we take 
an arbitrary network and transform it into a digraph where we apply Menger’s 
Theorem. By multiplying all capacities by the least common denominator, we 
may assume that the capacities are integers. 

Given a network N with integer capacities, we form a digraph D by split- 
ting eacn edge of capacity j into j edges with the same endpoints. For N, 
duality yields max val(f) < mincap(S, 7). This time we want to use Menger’s 
Theorem on D to obtain the reverse inquality, so in contrast to Remark 4.3.13 
our desired computation is 


max val(f) > A,(s,t) =Kkp(s, t) => mincap(S, T). 


A set of 4’(s, t) pairwise edge-disjoint s,t-paths in D collapses into a flow 
of value A’(s,t) in N, since the number of copies of each edge in D equals the 
capacity of the edge in N. Thus maxval(f) > A’(s, t). 

Now, let F be a set of «’'(s, t) edges disconnecting ¢ from s in D. Ife € F, 
then the minimality of F implies that D — (F — e) has an s, t-path P through 
e. If some other copy e’ of the edge e = uv is not in F, then P can be rerouted 
along e’ to obtain an s,t-path in D — F. Therefore, F contains all copies or no 
copies of each multiple edge in D. Hence x’(s, t) is the sum of the capacities on 
a set of edges that disconnects t from s in N. Letting S be the set of vertices 
reachable from s in D — F, we have cap(S,T) =k ‘(s,t). The minimum cut has 
at most this capacity, so mincap(S, Ths < x’(s,t), and we have proved all the 
needed inequalities. | 


gram.m 


For combinatorial applications, Méhger’s Theorem may yield simpler proofs 
than the Max-flow Min-cut Theorem (compare Theorem 4.2.25 with ?? ). Never- 
theless, our proof of Menger’s Theorem in Section 4.2 is awkward to implement 
algorithmically. For large-scale computations, network flow and the Ford- 
Fulkerson labeling algorithm are more appropriate. Indeed, most algorithms 
that compute connectivity in graphs and digraphs use network flow methods 
(Stoer—Wagner [1994] presents a different approach). 

We present other network models for combinatorial problems. For exam- 
ple, the other local versions of Menger’s Theorem can also be obtained directly. 


4.3.15. Remark. Other transformations. For each version of Menger’s Theo- 
rem, we encode the path problem using network flows with integer capacities. 

To obtain a network model for the problem of internally disjoint paths in a 
digraph D, we must prevent two units of flow from passing through a vertex. 
This can be done by replacing each vertex v with two vertices v_, v* that inherit 
the entering and exiting edges at v. By adding an edge of unit capacity from 
v~ to vt, we obtain the effect of limiting flow through v to one unit. By putting 
very large capacity (essentially infinite) on the edges that were in D, we ensure 
that a minimum cut will count only edges of the form v7 vt 

To obtain a network model for the problem of edge-disjoint paths in a graph 
G, we must permit flow to pass either way in an edge. This can be done by 
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replacing each edge uv with two directed edges uv and vu. When the network 
sends unit flow in both directions, in effect the edge is not being used at all. 

In each case, a flow in the network provides a set of paths, and a minimum 
cut leads to a separating set of vertices or edges. As in Remark 4.3.13, duality 
then gives us the desired equality in Menger’s Theorem. To model the problem 
of internally disjoint paths in a graph, we need both of these transformations. 
Exercises 5—7 request the details of these proofs. | 
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4.3.16. Application. Baseball Elimination Problem (Schwartz [1966]). At 
some time during the season, we may wonder whether team X can still win 
the championship. In other words, can winners be assigned for the remaining 
games so that no team ends with more victories than X? If so, then such an as- 
signment exists with X winning all its remaining games, reaching W wins. We 
want to know whether winners can be chosen for other games so that no team 
obtains more than W wins. To test this, we create a network where units of 
flow correspond to the remaining games. 

Let X,,..., X, be the other teams. Include nodes x1, ..., x, for the n teams, 
nodes y,,; for the (5) pairs of teams, anda source s and sink t. Put an edge from 
s to each team node and an edge from each pair node to ¢. Each pair node y, ; 
is entered by edges from x; andx;. = 

The capacities model the constraints. The capacity on edge y; ;t is a;,;, the 
number of remaining games between X; and X;. Given that X; has won w,; 
games already, the capacity on edge sx; is W — w; to keep X in contention. The 
capacity on edges x; y;,; and x; y;,; is oo (the number of games x; can win from x; 
is constrained by the capacity on y; ;t). 


By the integrality theorem, a maximum flow breaks into flow units. Each 
unit corresponds to one game; the first edge specifies the winner, and the last 
edge specifies the pair. The network has a fiow of value )/, ; i; if and only if 
all remaining games can be played with no team exceeding W wins; this is the 
condition for X remaining in contention. 

By the Max-flow Min-cut Theorem, there is a flow of value )° a; ; if and only 


if every cut has capacity at least }\ a; ;. Let S,T be a cut with finite capacity, 
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and let Z = {i: x; € T}. Since c(xy;,;) = oo, we cannot have x; € S and y;,; € 
T; thus y;,; € S whenever i or j is not in Z. To minimize capacity, we put 
yi,j € T whenever {i, j} C Z. Now cap(S, T) = iiez(W — wi) + oy. jgz%,j- The 
condition that every cut have capacity at least }° a;,; becomes 


yw —w;) > > i,j for all Z C [n]. 


ieZ {i,/}SZ 


Note that. this condition is obviously necessary; it states that we need enough 
leeway in the total wins among teams indexed by Z in order to accommodate 
winners for all the games among these teams. We have proved TONCAS. 4g 


Combinatorial applications of network flow usually involve showing that 
the desired configuration exists if and only if a related network has a large 
enough flow. As in Application 4.3.16, the Max-flow Min-cut Theorem then 
yields a necessary and sufficient condition for its existence. Other examples 
include most of Exercises 5— and also Exercise 13 and Theorems 4.3.17—4.3.18. 


SUPPLIES AND DEMANDS (optional) 


Next we consider a more general network model. We allow multiple sources 
and sinks, and also we associate with each source x; a supply o(x;) and with 
each sink y; a demand @(y;). To the capacity constraints for edges and conser- 
vation constraints for internal NOES: we pede transportation constraints for 
the sources and sinks. ei 


teleg 


ft(x) — f-() < o(x;) for each source x; 
f-~ Qj) — f* (yy) = 9(y,) for each sink y, 


The resulting configuration is a transportation network. With posi- 
tive values for the demands, the zero flow is not feasible. We seek a feasible 
flow satisfying these additional constraints. The “supply/demand” terminology 
suggests the constraints; we must satisfy the demands at the sinks without ex- 
ceeding the available supply at any source. This model is appropriate when a 
company has multiple distribution centers (sources) and retail outlets (sinks). 

Let X and Y denote the sets of sources and sinks, respectively. Let o(A) = 
ves O(v) and 0(B) = >°,., 9(v) denote the total supply. or demand at a set 
AC X or B CY. Fora set F of edges, let c(F) = )_,-,c(e). Given a set T of 
vertices, the net demand 0(YNT)—oa(XNMT) must be satisfied by flow from the 
remaining vertices. Hence it is necessary that c({T, 7]) be at least this large. 
Satisfying this for every set T is also sufficient for a feasible flow (TONCAS). 


4.3.17. Theorem. (Gale [1957]) In a transportation network N with sources X 
and sinks Y, a feasible flow exists if and only if 
c([S, T]) => AY NT) —a(XNT) 


for every partition of the vertices of N into sets S and T. 
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‘Proof: We have already observed the necessity of the condition. For sufficiency, 
construct a new network N’ by adding a supersource s and a supersink fr, with 
an edge of capacity o(x;) from s to each x; € X and an edge of capacity 90(y,) 
from each y; € Y tot. The transportation network N has a feasible flow if and 
only if N’ has a flow saturating each edge to t (a flow of value 0(Y)). 

By the Ford—Fulkerson Theorem, we know that N’ has a flow of value 0(Y) 
if and only if cap(S Us, T Ut) > 0(Y) for each partition S,T of V(N). The cut 
[S Us, T Ut] in N’ consists of [S, T] from N, plus edges from s to T and edges 
from S tot in N’. Hence 


cap(S Us, T Ut) =c(S,T)+0(TNX)+a(S NY). 
We now have cap(S Us, T Ut) > 0(Y) if and only if 
c(S,T)+a0(X NT) = 0(Y) —A(YNS)=a(YNT), - 


which is the condition assumed. | 


For specific instances, the construction of N’ is the key point, because we 
produce a feasible flow in N (when it exists) by running the Ford—Fulkerson al- 
gorithm on the network N’. When costs (per unit flow) are attached to the edges, 
we have the Min-cost Flow Problem,.which generalizes the Transportation 
Problem of Application 3.2.14. Solution algorithms for the Min-cost Flow Prob- 
lem appear in Ford—Fulkerson [1962] and in Ahuja~Magnanti—Orlin [1993]. 

We discuss several applications of-Gale’s condition. A pair of integer lists 
Pp = (p1,.--, Pm) and q = (q41,..., 4n) is bigraphic (Exercise 1.4.31) if there is 
a simple X, Y-bigraph such that the vertices of X have degrees pi,..., Pm and 
the vertices of Y have degrees q1,..., qn. Clearly > p; = >- q; is necessary, but 
this condition is not sufficient. To test whether (p, q) is bigraphic, we create a 
network in which units of flows will correspond to edges in the desired graph. 
The result is a bipartite analogue of the Erdés—Gallai condition for graphic 
sequences (Exercise 3.3.28). 


4.3.18. Theorem. (Gale [1957], Ryser [1957]) If p,q are lists of nonnegative 
integers with p, > --- > pm and qi > --: > qn, then (p,q) is bigraphic if 
and only if )>_., min{p;, k} > an qj forl <k <n. 

Proof: Necessity. Let G be a simple X, Y-bigraph realizing (p,q). Consider 

the edges incident to a set of k vertices in Y. Because G is simple, each x; € X 

is incident to at most k of these edges, and also x; is incident to at most p; of 

these edges. Hence }-;", min{p;, k} is an upper bound on the number of edges 
incident to any k vertices of Y, such as those with degrees qi, ..., qx. 
Sufficiency. Given (p,q), create a network N with an edge of capacity 1 
from x; to y; for each i, j, and let o(x;) = p; and 0(y,;) = q;. Unit capacity pre- 
vents multiple edges, and (p, q) is realizable if and only if N has a feasible flow. 
It suffices to show that the stated condition on p and q implies the condition 

of Theorem 4.3.17. For S C V(N), let 7(S) = {i: x; € S} and J(S) = {j: y; € S}. 

For a partition S, T of V(N), we now have o(X NT) = Dien) pi ANd A(Y NT) = 

jenn) qj, and we have c([S, T]) = |7(S)|- |J(T)|. 
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Letting k = |/(T)|, this last quantity becomes 
c([S, T])) = (S[k = riers) * > Viens) min{pi, k}. 


Also ienr) Di = ier(7) min{p;, k}, and Dien qj < ys q;- Combining these 
inequalities, the condition )-/., min{p;,k} > La qj implies c([S, T]) > a(y Nn 
T)—o(XNT). Since this holds for each partition S, 7, the network has a feasible 
flow, which yields the desired bipartite graph. | 


We can extend the maximum flow problem by imposing a nonnegative lower 
bound on the permitted flow in each edge. The capacity constraint remains as 
an upper bound, so we require /(e) < f(e) < u(e) for the flow f(e). We still im- 
pose conservation constraints on the internal nodes. If we have a feasible flow, 
then an easy modification of the Ford—Fulkerson labeling algorithm allows us 
to find a maximum (or minimum) feasible flow (Exercise 4). The difficulty is 
finding an initial feasible flow. First we present an application. 


4.3.19. Application. Matrix rounding (Bacharach [1966]). We may want to 
round the entries of a data matrix up or down to integers. We also want to 
present integers for the row sums and.column sums. The sum of each rounded 
row or column should be a rounding ofthe original sum. The resulting integer 
matrix, if it exists, is a consistent rounding. 

We can represent the consistent rounding problem as a feasible flow prob- 


lem. Establish vertices x),...,x, for the rows and vertices yi,...,¥, for the 
columns of the matrix. Add a source s ‘and a sink t. Add edges sx;, x; y;, yjt for 
all values of i and j. If the matrix has entries a; j with row-sums ry,...,7, and 
column-sums 5;,..., 5,, set 


l(sxi)=Lril laiy;)) = |aij| lojt) = le | 
u(sx;) = [ri] uQay;) = [aj] uQyjt) = [<i] 


We test for a feasible flow by transforming again to an ordinary maximum 
flow problem. With these two transformations, we can use network flow to test 
for the existence of a consistent rounding. | 


6.3 7.6 4.6 
(2 2.3 28 
5.5 4,5 3.5 
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4.3.20. Solution. Circulations and flows with lower bounds. In a maximum 
flow problem with upper and lower bounds on edge capacities, the zero flow is 
not feasible, so the Ford—Fulkerson labeling algorithm has no place to start. 
We must first obtain a feasible flow, after which an easy modification of the 
labeling algorithm applies (Exercise 4). 

The first step is to add an edge of infinite capacity from the sink to the 
source. The resulting network has a feasible flow with conservation at every 
node (called a circulation) if and only if the original network has a feasible 
flow. In a circulation problem, there is no source or sink. 

Next, we convert a feasible circulation problem C into a maximum flow 
problem N by introducing supplies or demands at the nodes and adding a source 
and sink to satisfy the supplies and demands. Given the flow constraints /(e) < 
f(e) < u(e), let c(e) = u(e) —1(e) for each edge e. For each vertex v, let 


I~ (v) — oeelV(C)—v.vl i(e), 
Mv) = Veeety.vicy—u /(); 
b(v) = I-(v)—-It(v). 


Since each /(uv) contributes to /+(u) and 1~(v), we have >> b(v) = 0. A feasible 
circulation f must satisfy the flow constraints at each edge and satisfy ft(@)- 
f~(v) = 0 at each node. Letting f’(e) = f(e) — l(e), we find that f is a feasible 
circulation in C if and only if f’ satisfies 0 < f’(e) < c(e) on each edge and 
f't(v) — f’-(v) = b(v) at each vertex. = 

This transforms the feasible circulation problem into a flow problem with 
supplies and demands. If b(v) > 0, then v supplies flow |b(v)| to the net- 
work; otherwise v demands |b(v)|. To restore conservation constraints, we add 
a source s with an edge of capacity b(v) to each v with b(v) > 0, and we add a 
sink t with an edge of capacity —b(v) from each v with b(v) < 0. This completes 
the construction of N. 

Let a be the total capacity on the edges leaving s; since }) b(v) = 0, the 
edges entering t also have total capacity a. Now C has a feasible circulation f 
if and only if N has a flow of value a (saturating all edges out of s or intot). # 


4.3.21. Corollary. A network D with conservation constraints at every node 
has a feasible circulation if and only if > 7.15. yl) s ecels.s} “(e) for every 
SCV(D). 


Proof: We can stop before the last step in the discussion of Solution 4.3.20 
and interpret our problem with supplies and demands in the model of Theorem 
4.3.17. Since }° b(v) = 0, the only way to satisfy all the demands is to use up 
all the supply. Hence there is a circulation if and only if the supply/demand 
problem with supplies o(v) = b(v) for {v € V(D) : b(v) = 0} and demands 
d(v) = —b(v) for {v € V(D) : b(v) < 0} has a solution. 

Theorem 4.3.17 characterizes when this problem has a solution. Translated 
back into the lower and upper bounds on flow in the original problem (Exer- 
cise 22), the criterion of Theorem 4.3.17 becomes eels Hi l(e) < . efs.s) 4(e) for 
every S C V(D). | 
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EXERCISES 


4.3.1. (—) In the network below, list all integer-valued feasible flows and select a flow of 
maximum value (this illustrates the advantage of duality over exhaustive search). Prove 
that this flow is a maximum flow by exhibiting a cut with the same value. Determine 
the number of source/sink cuts. (Comment: There is a nonzero flow with value 0.) 


4.3.2. (—) In the network below, find a maximum flow from s to t. Prove that your 
answer is optimal by using the dual problem, and explain why this proves optimality. 


raat 
PNA t 


4.3.3. (—) A kitchen sink draws water from two tanks according to the network of pipes 
with capacities per unit time shown below. Find the maximum flow. Prove that your 
answer is optimal by using the dual problem, and explain why this proves optimality. 


Sink 


4.3.4. Let N be a network with edge capacity and node conservation constraints plus 
lower bound constraints /(e) on the flow in edges, meaning that f(¢) > I(e) is required. 
If an initial feasible flow is given, how can the Ford—Fulkerson labeling algorithm be 
modified to search for a maximum feasible flow in this network? 


Section 4.3: Network Flow Problems 189 


4.3.5. (!) Use network flows to prove Menger’s Theorem for internally-disjoint paths in 
digraphs: «(x, y) = A(x, y) when xy is not an edge. (Hint: Use the first transformation 
suggested in Remark 4.3.15.) 


4.3.6. (!) Use network flows to prove Menger’s Theorem for edge-disjoint paths in graphs: 
k'(x, y) =A’(x, y). (Hint: Use the second transformation suggested in Remark 4.3.15.) 


4.3.7. (!) Use network flows to prove Menger’s Theorem for nonadjacent vertices in 
graphs: x(x, y) = A(x, y). (Hint: Use both transformations suggested in Remark 4.3.15.) 


4.3.8. Let G be a directed graph with x, y € V(G). Suppose that capacities are specified 
not on the edges of G, but rather on the vertices (other than x, y); for each vertex there is 
a fixed limit on the total flow through it. There is no restriction on flows in edges. Show 
how to use ordinary network flow theory to determine the maximum value of a feasible 
flow from x to y in the vertex-capacitated graph G. 


4.3.9. Use network flows to prove that a graph G is connected if and only if for every 
partition of V(G) into two nonempty sets 5,7, there is an edge with one endpoint in 
S and one endpoint in 7. (Comment: Chapter 1 contains an easy direct proof of the 
conclusion, so this is an example of “using a sledgehammer to squash a bug”.) 


4.3.10. (!) Use network flows to prove the Kénig—Egervary Theorem (a’(G) = B(G) ifG 
is bipartite). 


4.3.11. Show that the Augmenting Path Algorithm for bipartite graphs (Algorithm 
3.2.1) is a special case of the Ford-Fulkerson Labeling Algorithm. 


4.3.12. Let [S, S] and [T, T] be source/sink cuts in anetwork N. _ _ 

a) Prove that cap(S UT, SUT) + cap(S NT, SOT) < cap([S, S]) + cap(T, T). (Hint: 
Draw a picture and consider contributions from various types of edges.) 

b) Suppose that [S, S] and [7,7] are minimum cuts. Conclude from part (a) that 
[SUT, SUT] and [SN T,ST] are also minimum cuts. Conclude also that no edge 
between S — T and T — Shas positive capacity. 


4.3.13. (!).Several companies send representatives to a conference; the ith company 
sends m; representatives. The organizers of the conference conduct simultaneous net- 
working groups; the jth group can accommodate up to n; participants. The organizers 
want to schedule all the participants into groups, but the participants from the same 
company must be in different groups. The groups need not all be filled. 

a) Show how to use network flows to test whether the constraints can be satisfied. 

b) Let p be the number of companies, and let g be the number of groups, indexed 
so that m, >--- >m, and n; <--- < ng. Prove that there exists an assignment of par- 
ticipants to groups that satisfies all the constraints if and only if, for all 0 < k < p and 
0 </ <q, it holds that k(g —/) + 4 nj = 4 mj. 


4.3.14. In a large university with k academic departments, we must appoint an impor- 
tant committee. One professor will be chosen from each department. Some professors 
have joint appointments in two or more departments, but each must be the designated 
representative of at most one department. We must use equally many assistant profes- 
sors, associate professors, and full professors among the chosen representatives (assume 
that k is divisible by 3). How can the committee be found? (Hint: Build a network in 
which units of flow correspond to professors chosen for the committee and capacities en- 
force the various constraints. Explain how to use the network to test whether such a 
committee exists and find it if it does.) (Hall.[1956]) 
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4.3.15. Let G be a weighted graph. Let the value of a spanning tree be the minimum 
weight of its edges. Let the cap from a edge cut [S, S] be the maximum weight of its 
edges. Prove that the maximum value of a spanning tree of G equals the minimum cap 
of an edge cut in G. (Ahuja—Magnanti—Orlin [1993, p538]) 


4.3.16. (+) Let x be a vertex of maximum outdegree in a tournament 7. Prove that T 
has a spanning directed tree rooted at x such that every vertex has distance at most 2 
from x and every vertex other than x has outdegree at most 2. (Hint: Create a network to 
model the desired paths to the non-successors of x, and show that every cut has enough 
capacity. Comment: This strengthens Proposition 1.4.30 about kings in tournaments; 
no vertex need be an intermediate vertex for more than two others.) (Lu [1996]) 


4.3.17. (*—) Use the Gale-Ryser Theorem (Theorem 4.3.18) to determine whether 
there is a simple bipartite graph in which the vertices in one partite set have degrees 
(5, 4,4, 2, 1) and the vertices in the other partite set also have degrees (5, 4, 4, 2, 1). 


4.3.18. (+—) Given list r = (r,...,7,) and s = = (5,,...,5,), obtain necessary and suffi- 
cient conditions for the existence of a digraph D with vertices v;,..., v, such that each 
ordered pair occurs at most once as an edge and d*(v;) =r; and d~ Ge = 5; for alli. 


4.3.19. (*—) Find a consistent eae of the data in the matrix below. Is it unique? 
(Every entry must be 0 or 1.) 


bo “6 6 
5 65 .7 
6 65 .7 


4.3.20. (*) Prove that every two-by-two matrix can be consistently rounded. 


4.3.21. (*) Suppose that every entry in an n-by-n matrix is strictly between 1/n and 
1/(n — 1). Describe all consistent roundings. 


4.3.22. (*) Complete the details of proving Corollary 4.3.21, proving the necessary and 
sufficient condition for a circulation in a network with lower and upper bounds. 


4.3.23. («!) A (k + /)-regular graph G is (k, /)-orientable if it can be oriented so that 
each indegree is k or /. 

a) Prove that G is (k, /)-orientable if and only if there is a partition X, Y of V(G) 
such that for every S C V(G), 


(k-D(XNS|-1¥ AS) < |S, SI]. 


(Hint: Use Theorem 4.3.17.) 
b) Conclude that if G is (k,/)-orientable and k > /, then G'is also (k — 1,/ + 1)- 
orientable. (Bondy—Murty [1976, p210—211]) 
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Coloring of Graphs 


5.1. Vertex Coloring and Upper Bounds 


The committee-scheduling example (Example 1.1.11) used graph coloring 
to model avoidance of conflicts. Similarly, in a university we want to assign 
time slots for final examinations so that two courses with a common student 
have different slots. The number of slots needed is the chromatic number of 
the graph in which two courses are adjacent if they have a common student. 

Coloring the regions of a map with different colors on regions with common 
boundaries is another example; we return to it in Chapter 6. The map on the 
left below has five regions, and four colors suffice. The graph on the right mod- 
els the “common boundary” relation and the corresponding coloring. Labeling 
of vertices is our context for coloring problems. 


ae 
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DEFINITIONS AND EXAMPLES 


Graph coloring takes its name from the map-coloring application. We as- 
sign labels to vertices. When the numerical value of the labels is unimportant, 
we call them “colors” to indicate that they may be elements of any set. 


5.1.1. Definition. A k-coloring of a graph G is a labeling f: V(G) — S, where 
|S| = k (often we use S = [k]). The labels are colors; the vertices of one 
color form a color class. A k-coloring is proper if adjacent vertices have 
different labels. A graph is k-colorable if it has a proper k-coloring. The 
chromatic number x (G) is the least k such that G is k-colorable. 
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5.1.2. Remark. In a proper coloring, each color class is an independent set, so 
G is k-colorable if and only if V(G) is the union of k independent sets. Thus 
“k-colorable” and “k-partite” have the same meaning. (The usage of the two 
terms is slightly different. Often “k-partite” is a structural hypothesis, while 
“k-colorable” is the result of an optimization problem.) 

Graphs with loops are uncolorable; we cannot make the color of a vertex 
different from itself. Therefore, in this chapter all graphs are loopless. 
Also, multiple edges are irrelevant; extra copies don’t affect colorings. Thus 
we usually think in terms of simple graphs when discussing colorings, and 
we will name edges by their endpoints. Most of the statements made without 
restriction to simple graphs remain valid when multiple edges are allowed. & 


5.1.3. Example. Since a graph is 2-colorable if and only if it is bipartite, Cs 


and the Petersen graph have chromatic number at least 3. Since they are 3- 
colorable, as shown below, they have chromatic number exactly 3. | 


a 


5.1.4. Definition. A graph G is k-chromatic if x(G) = k. A proper k-coloring 
of a k-chromatic graph is an optimal coloring. If x(H) < x(G) =k for 
every proper subgraph H of G, then G is color-critical or k-critical. 


legram.me/math_books 


§.1.5. Example. k-critical graphs for small k. Properly coloring a graph needs 
at least two colors if and only if the graph has an edge. Thus K2 is the only 
2-critical graph (similarly, K, is the only 1-critical graph). Since 2-colorable is 
the same as bipartite, the characterization of bipartite graphs implies that the 
3-critical graphs are the odd cycles. 

We can test 2-colorability of a graph G by computing distances from a 
vertex x (in each component). Let X = {u € V(G): d(u,x) is even}, and let 
Y = {u € V(G): d(u, x) is odd}. The graph G is bipartite if and only if X, Y isa 
bipartition, meaning that G[X] and G[Y] are independent sets. 

No good characterization of 4-critical graphs or test for 3-colorability is 
known. Appendix B discusses the computational ramifications. a 


5.1.6. Definition. The clique number of a graph G, written w(G), is the 
maximum size of a set of pairwise adjacent vertices (clique) in G. 


We have used a(G) for the independence number of G; the usage of w(G) 
is analogous. The letters a and w are the first and last in the Greek alphabet. 
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This is consistent with viewing independent sets and cliques as the beginning 
and end of the “evolution” of a graph (see Section 8.5). 

5.1.7. Proposition. For every graph G, x(G) > w(G) and x(G) > mae 

Proof: The first bound holds because vertices of a clique require distinct colors. 
The second bound holds because each color class is an independent set and thus 
has at most a(G) vertices. rs 


Both bounds in Proposition 5.1.7 are tight when G is a complete graph. 


5.1.8. Example. x(G) may exceed w(G). For r > 2, let G = Co,,1 V K, (the join 
of Co,,1 and K,—see Definition 3.3.6). Since C2,,1 has no triangle, w(G) = s+2. 

Properly coloring the induced cycle requires at least three colors. The s- 
clique needs s colors. Since every vertex of the induced cycle is adjacent to 
every vertex of the clique, these s colors must differ from the first three, and 
x(G) > s+ 3. We conclude that x(G) > w(G). a 


Exercises 23-30 discuss the chromatic number for special families of 
graphs. We can also ask how it behaves under graph operations. For the disjoint 
union, x(G + H) = max{x(G), x(H)}. For the join, x(G V H) = x(G) + x(H). 
Next we introduce another combining operation. 


5.1.9. Definition. The cartesian product of G and H, written GOH, is the 
graph with vertex set V(G) x V(H) specified by putting (u, v) adjacent to 
(u’, v’) if and only if (1) u = wv’ and vv’ € E(A), or (2) v = v' and uw’ € E(G). 


5.1.10. Example. The cartesian product operation is symmetric; GOH = HuG. 
Below we show C30.C4. The hypercube is another familiar example: Q,; = 
Q,-10 Ka when k > 1. The m-by-n grid is the cartesian product P,, 0 Py. 

In general, GO H decomposes into copies of H for each vertex of G and 
copies of G for each vertex of H (Exercise 10). We use 0 instead of x to avoid 
confusion with other product operations, reserving x for the cartesian product 
of vertex sets. The symbol O, due to Rédl, evokes the identity KoO0Ke=C4. @ 


x (x, d) 


Z (z, d) 
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5.1.11. Proposition. (Vizing [1963], Aberth [1964]) x (GOH) =max{x(G),x(H)}. 


Proof: The cartesian product GOA contains copies of G and H as subgraphs, 
so x(GOH) => max{x(G), x(H)}. 

Let k = max{x(G), x(H)}. To prove the upper bound, we produce a proper 
k-coloring of GOH using optimal colorings of G and H. Let g be a proper x (G)- 
coloring of G, and let h be a proper x(H)-coloring of H. Define a coloring f of 
GOH by letting f(u, v) be the congruence class of g(u) + h(v) modulo k. Thus 
f assigns colors to V(GO A) from a set of size k. 

We claim that f properly colors GOH. If (u, v) and (u’, v’) are adjacent in 
GOH, then g(u) + h(v) and g(u’) +h(v’) agree in one summand and differ by 
between 1 and k in the other. Since the difference of the two sums is between 
1 and k, they lie in different congruence classes modulo k. | 


2 


The cartesian product allows us to compute chromatic numbers by comput- 
ing independence numbers, because a graph G is m-colorable if and only if the 
cartesian product Gd K,, has an independent set of size n(G) (Exercise 31). 


UPPER BOUNDS 


Most upper bounds on the chromatic number come from algorithms that 
produce colorings. For example, assigning distinct colors to the vertices yields 
x(G) < n(G). This bound is best possible, since x(K,) = n, but it holds with 
equality only for complete graphs. We can improve a “best-possible” bound by 
obtaining another bound that is always at least as good. For example, x(G) < 
n(G) uses nothing about the structure of G; we can do better by coloring the 
vertices in some order and always using the “least available” color. 


5.1.12. Algorithm. (Greedy coloring) 


The greedy coloring relative to a vertex ordering v1, ..., v, of V(G) is obtained 
by coloring vertices in the order v1, ..., v,, assigning to v; the smallest-indexed 
color not already used on its lower-indexed neighbors. | 


5.1.13. Proposition. x(G) < A(G) +1. 

Proof: In a vertex ordering, each vertex has at most A(G) earlier neighbors, 
so the greedy coloring cannot be forced to use more than A(G) + 1 colors. This 
proves constructively that x(G) < A(G) +1. | 
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The bound A(G) + 1 is the worst upper bound that greedy coloring could 
produce (although optimal for cliques and odd cycles). Choosing the vertex 
ordering carefully yields improvements. We can avoid the trouble caused by 
vertices of high degree by putting them at the beginning, where they won’t have 
many earlier neighbors (see Exercise 36 for a better ordering). 


5.1.14. Proposition. (Welsh—Powell (1967]) If a graph G has degree sequence 
d, >--->d,, then x(G) < 1+ max; min{d;,i — 1}. 


Proof: We apply greedy coloring to the vertices in nonincreasing order of de- 
gree. When we color the ith vertex v;, it has at most min{d;,i — 1} earlier 
neighbors, so at most this many colors appear on its earlier neighbors. Hence 
the color we assign to v; is at most 1+ min{d;,i — 1}. This holds for each vertex, 
so we maximize over i to obtain the upper bound on the maximum color used. @ 


The bound in Proposition 5.1.14 is always at most 1+ A(G), so this is 
always at least as good as Proposition 5.1.13. It gives the optimal upper bound 
in Example 5.1.8, while 1+ A(G) does not. 

In Proposition 5.1.14, we use greedy coloring with a well-chosen ordering. 
In fact, every graph G has some vertex ordering for which the greedy algorithm 
uses only x(G) colors (Exercise 33). Usually itis hard to find such an ordering. 

Our next example introduces a class of graphs where such an ordering i is 
easy to find. The ordering produces a coloring that achieves equality in the 
bound x(G) > w(G). 


am.me/ 


5.1.15. Example. Register allocation and interval graphs. A computer pro- 
gram stores the values of its variables in memory. For arithmetic computa- 
tions, the values must be entered in easily accessed locations called registers. 
Registers are expensive, so we want to use them efficiently. If two variables are 
never used simultaneously, then we can allocate them to the same register. For 
each variable, we compute the first and last time when it is used. A variable is 
active during the interval between these times. 

We define a graph whose vertices are the variables. Two vertices are ad- 
jacent if they are active at a common time. The number of registers needed is 
the chromatic number of this graph. The time when a variable is active is an 
interval, so we we obtain a special type of representation for the graph. 

An interval representation of a graph is a family of intervals assigned 
to the vertices so that vertices are adjacent if and only if the corresponding in- 
tervals intersect. A graph having such a representation is an interval graph. 

For the vertex ordering a, b,c,d,e, f,g,h of the interval graph below, 
greedy coloring assigns 1,2,1,3,2,1,2,3, respectively, which is optimal. 
Greedy colorings relative to orderings starting a,d,... use four colors. | 


é 
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5.1.16. Proposition. If G is an interval graph, then x(G) = w(G). 


Proof: Order the vertices according to the left endpoints of the intervals in an 
interval representation. Apply greedy coloring, and suppose that x receives 
k, the maximum color assigned. Since x does not receive a smaller color, the 
left endpoint a of its interval belongs also to intervals that already have colors 
1 through k — 1. These intervals all share the point a, so we have a k-clique 
consisting of x and neighbors of x with colors 1 through k — 1. Hence w(G) > 
k > x(G). Since x(G) > w(G) always, this coloring is optimal. a 


5.1.17.* Remark. The greedy coloring algerithm runs rapidly. It is “on-line” in 
the sense that it produces a proper coloring even if it sees only one new vertex 
at each step and must color it with no option to change earlier colors. For a 
random vertex ordering in a random graph (see Section 8.5), greedy coloring 
almost always uses only about twice as many colors as the minimum, although 
with a bad ordering it may use many colors on a tree (Exercise 34). | 


We began with greedy coloring to underscore the constructive aspect of 
upper bounds on chromatic number. Other bounds follow from the properties 
of k-critical graphs but don’t produce proper colorings: every k-chromatic graph 
has a k-critical subgraph, but we have.no good algorithm for finding one. We 
derive the next bound usiny critical subgraphs; it can also be proved using 
greedy coloring (Exercise 36). 


math_| 


5.1.18. Lemma. If H i is a k-critical graph, then 6(H) > k —1. 


Proof: Let x be a vertex of H. Because H i is k-critical, H — x is k — 1-colorable. 
If dy (x) < k — 1, then the k — 1 colors used on H — x do not all appear on N(x). 
We can assign x a color not used on N(x) to obtain a proper k — 1-coloring of H. 
This contradicts our hypothesis that x(H) =k. We conclude that dy(x) >k—-—1 
(for each x € V(A)). | 


5.1.19. Theorem. (Szekeres—Wilf [1968]) If G is a graph, then x(G) < 1+ 
mMaxycG O(A). 


Proof: Let k = x(G), and let H’ be a k-critical subgraph of G. Lemma 5.1.18 
yields x(G) —1= x(A’) — 1 < 8(A’*) < maxycg 5(A). | 


The next bound involves orientations (see also Exercises 43—45). 


5.1.20. Example. If G is bipartite, then the orientation of G that directs every 
edge from one partite set to the other has no path (in the directed sense) of 
length more than 1. The next theorem thus implies that x(G) < 2. 

Every orientation of an odd cycle must somewhere have two consecutive 
edges in the same direction. Thus each orientation has a path of length at least 
two, and the theorem confirms that an odd cycle is 3-chromatic. | 


5.1.21. Theorem. Gallai-Roy—Vitaver Theorem (Gallai [1968], Roy [1967], Vi- 
taver [1962]) If D is an orientation of G with longest path length /(D), then 
x(G) < 1+/(D). Furthermore, equality holds for some orientation of G. 
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Proof: Let D be an orientation of G. Let D’ be a maximal subdigraph of D 
that contains no cycle (in the example below, uv is the only edge of D not in D’). 
Note that D’ includes all vertices of G. Color V(G) by letting f(v) be 1 plus the 
length of the longest path in D’ that ends at v. 

Let P be a path in D’, and let u be the first vertex of P. Every path in D’ 
ending at u has no other vertex on P, since D’ is acyclic. Therefore, each path 
ending at u (including the longest such path) can be lengthened along P. This 
implies that f strictly increases along each path in D’. 

The coloring f uses colors 1 through 1+/(D’) on V(D’) (which is also V(G)). 
We claim that f is a proper coloring of G. For each uv € E(D), there is a path in 
D’ between its endpoints (since uv is an edge of D’ or its addition to D’ creates 
a cycle). This implies that f(u) 4 f(v), since f increases along paths of D’. 

To prove the second statement, we construct an orientation D* such that 
l(D*) < x(G) —1. Let f be an optimal coloring of G. For each edge uv in G, 
orient it from u to v in D* if and only if f(u) < f(v). Since f is a proper coloring, 
this defines an orientation. Since the labels used by f increase along each path 
in D*, and there are only x(G) labels in f, we have /(D*) < x(G) —1. a 
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BROOKS’ THEOREM 


The bound x(G) < 1+ A(G) holds with equality for complete graphs and 
odd cycles. By choosing the vertex ordering more carefully, we can show that 
these are essentially the only such graphs. This implies, for example, that 
the Petersen graph is 3-colorable, without finding an explicit coloring. To avoid 
unimportant complications, we phrase the statement only for connected graphs. 
It extends to all graphs because the chromatic number of a graph is the maxi- 
mum chromatic number of its components. Many proofs are known; we present 
a modification of the proof by Lovasz [1975]. 


5.1.22. Theorem. (Brooks [1941]) If G is a connected graph other than a com- 
plete graph or an odd cycle, then x(G) < A(G). 

Proof: Let G be a connected graph, and let k = A(G). We may assume that 

k > 3, since G is a complete graph when k < 1, and G is an odd cycle or is 

bipartite when k = 2, in which case the bound holds. 
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Our aim is to order the vertices so that each has at most k — 1 lower-indexed 
neighbors; greedy coloring for such an ordering yields the bound. 

When G is not k-regular, we can choose a vertex of degree less than k as 
v,. Since G is connected, we can grow a spanning tree of G from v,, assigning 
indices in decreasing order as we reach vertices. Each vertex other than v, in 
the resulting ordering v,..., v, has a higher-indexed neighbor along the path 
to v, in the tree. Hence each vertex has at most k — 1 lower-indexed neighbors, 
and the greedy coloring uses at most k colors. 


eg ees 


Uj 

In the remaining case, G is k-regular. Suppose first that G has a cut-vertex 
x, and let G’ be a subgraph consisting of a component of G — x together with its 
edges to x. The degree of x in G’ is less than k, so the method above provides 
a proper k-coloring of G’. By permuting the names of colors in the subgraphs 
resulting in this way from components of G — x, we can make the colorings 
agree on x to complete a proper k-coloring of G. 

We may thus assume that G is 2-connected. In every vertex ordering, the 
last vertex has k earlier neighbors. The greedy coloring idea may still work if 
we arrange that two neighbors of uv, get the same color. 

In particular, suppose that some vertex v, has neighbors v;, ve such that 
v1 # v2 and G — {vj, vg} is connected.-In this case, we index the vertices of a 
spanning tree of G — {v}, v2} using 3, .. , n such that labels increase along paths 
to the root v,. As before, each vertex before v, has at most k — 1 lower indexed 
neighbors. The greedy coloring also uses at most k — 1 colors on neighbors of 
Un, Since v; and vy receive the same color. 

Hence it suffices to show that every 2-connected k-regular graph with k > 3 
has such a triple vj, v2, v,. Choose a vertex x. If «(G — x) > 2, let v; be x and 
let vo be a vertex with distance 2 from x. Such a vertex vo exists because G is 
regular and is not a complete graph; let v, be a common neighbor of v, and ve. 

If «(G — x) = 1, let v,.= x. Since G has no cut-vertex, x has a neighbor in 
every leaf block of G — x . Neighbors v1, ve of x in two such blocks are nonadja- 
cent. Also, G — {x, v1, ve} is connected, since blocks have no cut-vertices. Since 
'k > 8, vertex x has another neighbor, and G — {v}, v2} is connected. | 


al v2 


0, =x 


§.1.23.* Remark. The bound x(G) < A(G) can be improyed when G has no 
large clique (Exercise 50). Brooks’ Theorem implies that the complete graphs 
and odd cycles are the only k — 1-regular k-critical graphs (Exercise 47). Gallai 
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[1963b] strengthened this by proving that in the subgraph of a k-critical graph 
induced by the vertices of degree k — 1, every block is a clique or an odd cycle. 

Brooks’ Theorem states that ¥(G) < A(G) whenever 3 < w(G) < A(G). 
Borodin and Kostochka [1977] conjectured that w(G) < A(G) implies x(G) < 
A(G) if A(G) > 9 (examples show that the condition A(G) > 9 is needed). Reed 
[1999] proved that this is true when A(G) > 1014. 

Reed [1998] also conjectured that the chromatic number is bounded by the 
average of the trivial upper and lower bounds; that is, x(G) < [ {O4+?@). = 


Because the idea of partitioning to satisfy constraints is so fundamental, 
there are many, many variations and generalizations of graph coloring. In 
Chapter 7 we consider coloring the edges of a graph. Sticking to vertices, we 
could allow color classes to induce subgraphs other than independent sets (“gen- 
eralized coloring”—Exercises 49-53). We could restrict the colors allowed to be 
used on each vertex (“list coloring”’—Section 8:4). We could ask questions in- 
volving numerical values of the colors (Exercise 54). We have only touched the 
tip of the iceberg on coloring problems. 


EXERCISES 


books 


5.1.1. (—) Compute the clique number, the independence number, and the chromatic 
number of the graph below. Does either bound in Proposition 5.1.7 prove optimality for 
some proper coloring? Is the graph color-critical? 


UT 
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5.1.2. (—) Prove that the chromatic number of a graph equals the maximum of the 
chromatic numbers of its components. 


5.1.3. (—) Let G,..., G, be the blocks of a graph G. Prove that x(G) = max; x (Gj). 
5.1.4. (—) Exhibit a graph G with a vertex v so that x (G—v) < x(G) and x(G—v) ‘< x(G). 


5.1.5. (—) Given graphs G and H, prove that x(G + H) = max{x(G), x(H)} and that 
x(GV H) = x(G)+ x(A). 


5.1.6. (—) Suppose that x(G) = w(G) + 1, as in Example 5.1.8. Let H, = G and H, = 
H,-1 Vv G fork > 1. Prove that x(H), = w(H), + k. 


5.1.7. (—) Construct a graph G that is neither a clique nor an odd cycle but has a vertex 
ordering relative to which greedy coloring uses A(G) + 1 colors. 


5.1.8. (—) Prove that maxycg46(H) < A(G) to explain why Theorem 5.1.19 is better 
than Proposition 5.1.13. Determine all graphs G such that maxycg 6(H) = A(G). 


5.1.9. (—) Draw the graph K,,30 Ps; and exhibit an optimal coloring of it. Draw Cs; 0C; 
and find a proper 3-coloring of it with color classes of sizes 9, 8, 8. 
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5.1.10. (—) Prove that GOH decomposes into n(G) copies of H and n(H) copies of G. 
5.1.11. (—) Prove that each graph below is isomorphic to C30C3. 


5.1.12. (—) Prove or disprove: Every k-chromatic graph G has a proper k-coloring in 
which some color class has a(G) vertices. 


5.1.13. (—) Prove or disprove: If G = F UH, then x(G) < x(F) + x(#). 
5.1.14. (—) Prove or disprove: For every graph G, x(G) < n(G) —a(G) + 1. 


5.1.15. (—) Prove or disprove: If G is a connected graph, then x(G) < 1+ a(G), where 
a(G) is the average of the vertex degrees in G. 


5.1.16. (—) Use Theorem 5.1.21 to prove that every tournament has a spanning path. 
(Rédei [1934]) 


5.1.17. (—) Use Lemma 5.1.18 to prove that x(G) < 4 for the graph G below. 


boc 


5.1.18. (—) Determine the number of colors needed to label V(K,,) such that each color 
class induces a subgraph with maximum degree at most k. 


5.1.19. (—) Find the error in the false argument below for Brooks’ Theorem (Theorem 
5.1.22). 

“We use induction on n(G); the statement holds when n(G) = 1. For the induction 
step, suppose that G is not a complete graph or an odd cycle. Since «(G) < 6(G), the 
graph G has a separating set S of size at most A(G). Let Gj,..., G, be the components 
of G — S, and let H; = G[V(G;) US]. By the induction hypothesis, each H; is A(G)- 
colorable. Permute the names of the colors used on these subgraphs to agree on S. This 
yields a proper A(G)-coloring of G.” 


e e e e e 
5.1.20. (!) Let G be a graph whose odd cycles are pairwise intersecting, meaning that 
every two odd cycles in G have a common vertex. Prove that x(G) < 5. 


5.1.21. Suppose that every edge of a graph G appears in at most one cycle. Prove that 
every block of G is an edge, a cycle, or an isolated vertex. Use this to prove that x(G) < 3. 


5.1.22. (!) Given a set of lines in the plane with no three meeting at a point, form a 
graph G whose vertices are the intersections of the lines, with two vertices adjacent 
if they appear consecutively on one of the lines. Prove that x(G) < 3. (Hint: This 
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can be solved by using the Szekeres—Wilf Theorem or by using greedy colori: + with an 
appropriate vertex ordering. Comment: The conclusion may fail when three lines are 
allowed to share a point.) (H. Sachs) 


Sex - & 


5.1.23. (!) Place n points on a circle, where n > k(k+1). Let G,,, be the 2k-regular graph 
obtained by joining each point to the k nearest points in each direction on the circle. For 
example, G,,1 = Cn, and G72, appears below. Prove that x(G,4) =k +1ifk +1 divides 
nand x(Grx) =k +2 ifk + 1 does not divide n. Prove that the lower bound on n cannot 
be weakened, by proving that x (Giusy-14) > K+2ifk > 2. 


5.1.24. (+) Let G be any 20-regular graph with 360 vertices formed in the following way. 
The vertices are evenly-spaced around a circle. Vertices separated by 1 or 2 degrees are 
nonadjacent. Vertices separated by 3, 4, 5-or 6 degrees are adjacent. No information 
is given about other adjacencies (except that G is 20-regular). Prove that x(G) < 19. 
(Hint: Color successive vertices in order around the circle.) (Pritikin) 


5.1.25. (+) Let G be the unit-distance graph in the plane; V(G) = R’, and two points 
are adjacent if their Euclidean distance is 1 (this is an infinite graph). Prove that 4 < 
x(G) < 7. (Hint: For the upper bound, present an explicit coloring by regions, paying 
attention to the boundaries.) (Hadwiger [1945, 1961], Moser—Moser [1961]) 


5.1.26. Given finite sets S;,..., Sn, let U = S; x --- x S,. Define a graph G with vertex 
set U by putting u < v ifand only ifu and v differ in every coordinate. Determine x(G). 


5.1.27. Let H be the complement of the graph in Exercise 5.1.26. Determine x (#2). 


5.1.28. Consider a traffic signal controlled by two switches, each of which can be set in 
n positions. For each setting of the switches, the traffic signal shows one of its n possible 
colors. Whenever the setting of both switches changes, the color changes. Prove that 
the color shown is determined by the position of one of the switches. Interpret this in 
terms of the chromatic number of some graph. (Greenwell—Lovasz [1974]) 


5.1.29. For the graph G below, compute x(G) and find a x (G)-critical subgraph. 
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5.1.30. (+) Let 5 = (3) denote the collection of 2-sets of the n-element set [n]. Define 
the graph G, by V(G,) = S and E(G,) = {(ij, jk): 1 < i < j < k <n} (disjoint pairs, 
for example, are nonadjacent). Prove that x(G,) = [lgn]. (Hint: Prove that G,, is r- 
colorable if and only if [r] has at least n distinct subsets. Comment: G,, is called the 
shift graph of K,,.) (attributed to A. Hajnal) 


5.1.31. (!) Prove that a graph G is m-colorable if and only ifa(GOK,,) > n(G). (Berge 
[1973, p379-80]) 


5.1.32. (!) Prove that a graph G is 2*-colorable if and only if G is the union of k bipartite 
graphs. (Hint: This generalizes Theorem 1.2.23.) 


5.1.33. (!) Prove that every graph G has a vertex ordering relative to which greedy 
coloring uses x (G) colors. 


5.1.34. (!) For all k ¢ N, construct a tree 7, with maximum degree k and an ordering o 
of V(7;,) such that greedy coloring relative to the ordering o uses k + 1 colors. (Hint: Use 
induction and construct the tree and ordering simultaneously. Comment: This result 
shows that the performance ratio of greedy coloring to optimal coloring can be as bad as 
(A(G) + 1)/2.) (Bean [1976]) 


5.1.35. Let G be a graph having no induced subgraph isomorphic to P,. Prove that 
for every vertex ordering, greedy coloring produces an optimal coloring of G. (Hint: 
Suppose that the algorithm uses k colors for the ordering v),...,v,, and let i be the 
smallest integer such that G has a clique consisting of vertices assigned colors i through 
k in this coloring. Prove that i = 1. Comment: P,-free graphs are also called cographs.) 


5.1.36. Given a vertex ordering o = v,.. =U, of a graph G, let G; = Gl{v,..., v;}] and 
f(o) = 1+ max; dg,(v;). Greedy coloring relative to o yields x(G) < f(c). Define o* 
by letting v, be a minimum degree vertex ‘of G and letting v; for i < n be a minimum 
degree vertex of G —{v;41,..., Un}. Show that f(o*) = 1+ maxycc 6(H), and thus that o* 
minimizes f(o). (Halin [1967], Matula [1968], Finck—Sachs [1969], Lick—White [1970]) 


5.1.37. Prove that V(G) can be partitioned into 1 + maxycg 5(f)/r classes such that 
every subgraph whose vertices lie in a single class has a vertex ot degree less than r. 
(Hint: Consider ordering o* of Exercise 5.1.36. Comment: This generalizes Theorem 
5.1.19. See also Chartrand—Kronk [1969] when r = 2.) 


5.1.38. (!) Prove that x(G) = w(G) when G is bipartite. (Hint: Phrase the claim in 
terms of G and apply results on bipartite graphs.) 


5.1.39. (!) Prove that every k-chromatic graph has at least (5) edges. Use this to prove 


that if G is the union of m complete graphs of order m, then x(G) < 1+m/m-—1. 
(Comment: This bound is near tight, but the Erdés—Faber—Lovasz Conjecture (see Erdés 
[1981]) asserts that x(G) = m when the complete graphs are pairwise edge-disjoint.) 


5.1.40. Prove that x(G) - x(G) > n(G), use this to prove that x(G) + x(G) > 2./n(G), 
and provide a construction achieving these bounds whenever ./n(G) is an integer. 
(Nordhaus—Gaddum [1956], Finck [1968]) 


5.1.41. (!) Prove that x (G)+ x (G) < n(G)+1. (Hint: Use induction on n(G).) (Nordhaus— 
Gaddum [1956]) 


5.1.42. (!) Looseness of x(G) = n(G)/a(G). Let G be an n-vertex graph, and let c = 
(n + 1)/a(G). Use Exercise 5.1.41 to prove that x(G) - x(G) < (n+ 1)2/4, and use 
this to prove that x(G) < c(n + 1)/4. For each odd n, construct a graph such that 
x(G) =c(n+ 1)/4. (Nordhaus—Gaddum [1956], Finck [1968]) 
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§.1.43. (!) Paths and chromatic number in digraphs. 

a) Let G = FUH. Prove that x(G) < x(F)x (#1). 

b) Consider an orientation D of G and a function f: V(G) — R. Use part (a) and 
Theorem 5.1.21 to prove that if x(G) > rs, then D has a path up > -:- > u, with 
f(uo) = -+: < f(€u,) or a path up > --- > v, with f(vo) > --- > fus). 

c) Use part (b) to prove that every sequence of rs + 1 distinct real numbers has an 
increasing subsequence of size r + 1 or a decreasing subsequence of size s + 1. (Erdés— 
Szekeres [1935]) 


5.1.44. (!) Minty’s Theorem (Minty [1962]). An acyclic orientation of a loopless graph 
is an orientation having no cycle. For each acyclic orientation D of G, let r(D) = 
maxc [a/b], where C is a cycle in G and a, b count the edges of C that are forward in 
D or backward in D, respectively. Fix a vertex x « V(G), and let W be a walk in G be- 
ginning at x. Let g(W) =a — b-r(D), where a is the number of steps along W that are 
forward edges in D and b is the number that are backward in D. For each y € V(G), let 
g(y) be the maximum of g(W) such that W is an x, y-walk (assume that G is connected). 

a) Prove that g(y) is finite and thus well-defined, and use g(y) to obtain a proper 
1+r(D)-coloring of G. Thus G is 1 + r(D)-colorable. 

b) Prove that x(G) = minpcp, where D is the set of acyclic orientations of G. 


5.1.45. (+) Use Minty’s Theorem (Exercise 5.1.44) to prove Theorem 5.1.21. (Hint: 
Prove that /(D) is maximized by some acyclic orientation of G.) 


5.1.46. (+) Prove that the 4-regular triangle-free graphs below are 4-chromatic. (Hint: 
Consider the maximum independent sets.“Comment: Chvatal [1970] showed that the 
graph on the left is the smallest a 4-regular 4-chromatic graph.) 
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5.1.47. (!) Prove that Brooks’ Theorem is equivalent to the following statement: every 
k — 1-regular k-critical graph is a complete graph or an odd cycle. 


5.1.48. Let G be a simple graph with n vertices and m edges and maximum degree 
at most 3. Suppose that no component of G is a complete graph on 4 vertices. Prove 
that G contains a bipartite subgraph with at least m — n/3 edges. (Hint: Apply Brooks’ 
Theorem, and then show how to delete a few edges to change a proper 3-coloring of G 
into a proper 2-coloring of a large subgraph of G.) 


5.1.49. (—) Prove that the Petersen graph can be 2-colored so that the subgraph induced 
by each color class consists of isolated edges and vertices. 


5.1.50. (!) Improvement of Brooks’ Theorem. 

a) Given a graph G, let ki, ..., k, be nonnegative integers with Ly k, > A(G)—t+1. 
Prove that V(G) can be partitioned into sets V,,..., V, so that for each i, the subgraph 
G; induced by V; has maximum degree at most k;.. (Hint: Prove that the partition 
minimizing )° e(G;)/k; has the desired property.) (Lovasz [1966]) 
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b) For 4 <r < A(G) +1, use part (a) to prove that x(G) < [ (ae) + 1)| when 


G has no r-clique. (Borodin—Kostochka [1977], Catlin [1978], Lawrence [1978]) 

5.1.51. (!) Let G be an k-colorable graph, and let P be a set of vertices in G such that 
d(x, y) => 4 whenever x, y € P. Prove that every coloring of P with colors from [k + 1] 
extends to a proper k + 1 coloring of G. (Albertson—Moore [1999]) 


5.1.52. Prove that every graph G can be [(A(G) + 1)/j|-colored so that each color class 
induces a subgraph having no j-edge-connected subgraph. For j > 1, prove that no 
smaller number of classes suffices when G is a j-regular j-edge-connected graph or 
is a complete graph with order congruent to 1 modulo j;. (Comment: For j = 1, the 
restriction reduces to ordinary proper coloring.) (Matula [1973]) 


5.1.53. (+) Let G,,, be the 2k-regular graph of Exercise 5.1.23. Fork < 4, determine the 
values of n such that G,,, can be 2-colored so that each color class induces a subgraph 
with maximum degree at most k. (Weaver—West [1994]) 


5.1.54. Let f be a proper coloring of a graph G in which the colors are natural numbers. 
The color sum is ) > -y (cg) f(v). Minimizing the color sum may require using more than 
x(G) colors. In the tree below, for example, the best proper 2-coloring has color sum 
12, while there is a proper 3-coloring with color sum 11. Construct a sequence of trees 
in which the kth tree 7, use k colors in a proper coloring that minimizes the color sum. 
(Kubicka—Schwenk [1989]) 
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5.1.55. (+) Chromatic number is bounded by one plus longest odd cycle length. 

a) Let G be a 2-connected nonbipartite graph containing an even cycle C. Prove 
that there exist vertices x, y on C and an-x, y-path P internally disjoint from C such 
that dc(x, y) # dp(x, y) mod 2. 

b) Let G be a simple graph with no odd cycle having length at least 2k + 1. Prove 
that if 5(G) > 2k, then G has a cycle of length at least 4k. (Hint: Consider the neighbors 
of an endpoint of a maximal path.) 

c) Let G be a 2-connected nonbipartite graph with no odd cycle longer than 2k — 1. 
Prove that x(G) < 2k. (Erdés—Hajnal [1966]) 
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5.2. Structure of k-chromatic Graphs 


We have observed that x(H) > w(#) for all H. When equality holds in this 
bound for G and all its induced subgraphs (as for interval graphs), we say that 
G is perfect; we discuss such graphs in Sections 5.3 and 8.1. Our concern with 
the bound x (G) > (G) in this section is how bad it can be. Almost always x(G) 
is much larger than w(G), in a sense discussed precisely in Section 8.5. (The 
average values of w(G), a(G), and x(G) over all graphs with vertex set [n] are 
very close to 2lgn, 2lgn, and n/(2lgn), respectively. Hence w(G) is generally 
a bad lower bound on x(G), and n/a(G) is generally a good lower bound.) 
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GRAPHS WITH LARGE CHROMATIC NUMBER 


The bound x(G) > w(G) can be tight, but it can also be very loose. There 
have been many constructions of graphs without triangles that have arbitrarily 
large chromatic number. We present one such construction here; others appear 
in Exercises 12-13. 


5.2.1. Definition. From a simple graph G, Mycielski’s construction pro- 
duces a simple graph G’ containing G. Beginning with G having vertex 
set {v1,..., v,}, add vertices U = {u,,...,u,} and one more vertex w. Add 
edges to make u; adjacent to all of Ng(v;), and finally let N(w) = U. 
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5.2.2. Example. From the 2-chromatic graph Ko, one iteration of Mycielski’s 
construction yields the 3-chromatic graph Cs, as shown above. Below we apply 
the construction to C5, producing the 4-chromatic Grétzsch graph. | 


5.2.3. Theorem. (Mycielski [1955]) From a k-chromatic triangle-free graph G, 
Mycielski’s construction produces a k + 1-chromatic triangle-free graph G’. 


Proof: Let V(G) = {v1,...,v,}, and let G’ be the graph produced from it by 
Mycielski’s construction. Let u,,...,u, be the copies of v1,..., v,, with w the 
additional vertex. Let U = {u,...,u,}. 

By construction, U is an independent set in G’. Hence the other vertices of 
any triangle containing u; belong to V(G) and are neighbors of v;. This would 
complete a triangle in G, which can’t exist. We conclude that G’ is triangle-free. 

A proper k-coloring f of G extends to a proper k + 1-coloring of G’ by setting 
f(uj) = f(v;) and f(w) = k + 1; hence x(G’) < x(G) + 1. We prove equality by 
showing that x(G) < x(G’). To prove this we consider any proper coloring of 
G’ and obtain from it a proper coloring of G using fewer colors. 

Let g be a proper k-coloring of G’ By changing the names of colors, we may 
assume that g(w) = k. This restricts g to {1,...,k — 1} on U. On V(G), it may 
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use all k colors. Let A be the set of vertices in G on which g uses color k; we 
change the colors used on A to obtain a proper k — 1-coloring of G. 

For each v; € A, we change the color of v; to g(u;). Because all vertices 
of A have color k under g, no two edges of A are adjacent. Thus we need only 
check edges of the form v;v’ with v; € A and v’ € V(G) — A. If v' © v;, then by 
construction also v’ < u;, which yields g(v’) # g(u;). Since we change the color 
on v; to g(u;), our change does not violate the edge v;v'. We have shown that 
the modified coloring of V(G) is a proper k — 1-coloring of G. | 


= 


If G is color-critical, then the graph G’ resulting from Mycielski’s construc- 
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tion is also color-critical (Exercise 9). 
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5.2.4.* Remark, Starting with G2 = Ko, iterating Mycielski’s construction pro- 
duces a sequence G2, G3, G4,... of graphs. The first three are Kz, Cs, and the 
Grotzsch graph. These are the smallest triangle-free 2-chromatic, 3-chromatic, 
and 4-chromatic graphs. The graphs then grow rapidly: n(G,) = 2n(G,_1) + 1. 
With n(G2) = 2, this yields n(G,) = 8 - 2*-? — 1 (exponential growth). 

Let f(k) be the minimum number of vertices in a triangle-free k-chromatic 
graph. Using probabilistic (non-constructive) methods, Erdés [1959] proved 
that f(k) < ck?+*, where is any positive constant and c depends on € but not on 
k. Using Ramsey numbers (Section 8.3), it is now known (non-constructively) 
that there are constants c1, co such that c)k? logk < f(k) < cok” logk. Exercise 
15 develops a quadratic lower bound. 

Blanche Descartes! [1947, 1954] constructed color-critical graphs with 
girth 6 (Exercise 13). Using probabilistic methods, Erdés [1959] proved that 
graphs exist with chromatic number at least k and girth at least g (Theorem 
8.5.11). Later, explicit constructions were found (Lovasz [1968a], NeSet¥ril—Rodl 
[1979], Lubotzsky—Phillips-Sarnak [1988], Kriz [1989)]). - 

By all these constructions, forbidding K, from G does not place a bound on 
x(G). Gyarfas [1975] and Sumner [1981]) conjectured that forbidding a fixed 
clique and a fixed forest as an induced subgraph does bound the chromatic 
number. Exercise 11 proves this when the forest is 2K2. (See also Kierstead— 
Penrice [1990, 1994], Kierstead [1992, 1997], Kierstead—Rédl [1996]) | 


This pseudonym was used by W.T. Tutte and also by three others. 
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EXTREMAL PROBLEMS AND TURAN’S THEOREM 


Perhaps extremal questions can shed some light on the structure of k- 
chromatic graphs. For example, which are the smallest and largest k-chromatic 
graphs with n vertices? 


5.2.5. Proposition. Every k-chromatic graph with n vertices has at least (5) 
edges. Equality holds for a complete graph plus isolated vertices. 


Proof: An optimal coloring of a graph has an edge with endpoints of colors i 
and j for each pair i, j of colors. Otherwise, colors i and j could be combined 
into a single color class and use fewer colors. Since there are (5) distinct pairs 
of colors, there must be 6) distinct edges. | 

Exercise 6 asks for the minimum size among connected k-chromatic graphs 
with n vertices. 

The maximization problem is more interesting (of course, it makes sense 
only when restricted to simple graphs). Given a proper k-coloring, we can con- 
tinue to add edges without increasing the chromatic number as long as two 
vertices in different color classes are nonadjacent. Thus we may restrict our 
attention to graphs without such pairs: 


§.2.6. Definition. A complete multipartite graph is a simple graph G whose 
vertices can be partitioned into sets so that u < v if and only if u and v be- 
long to different sets of the partition. Equivalently, every component of G is 
a complete graph. When k > 2, we write Kn,....n, for the complete k-partite 
graph with partite sets of sizes n1,;...,n, and complement K,, +--:+ Kn,- 


We use this notation only for k > 1, since K, denotes a complete graph. A 
complete k-partite graph is k-chromatic; the partite sets are the color classes 
in the only proper k-coloring. Also, since a vertex in a partite set of size f 
has degree n(G) — t, the edges can be counted using the degree-sum formula 
(Exercise 18). Which distribution of vertices to partite sets maximizes e(G)? 


5.2.7. Example. The Turdén graph. The Turan graph T,,, is the complete r- 
partite graph with n vertices whose partite sets differ in size by at most 1. By 
the pigeonhole principle (see Appendix A), some partite set has size at least 
[n/r] and some has size at most [n/r]|. Therefore, differing by at most 1 means 
that they all have size |n/r| or [n/r]. 

Let a = |n/r|. After putting a vertices in each partite set, b = n —ra 
remain, so T,,, has b partite sets of size a + 1 and r — b partite sets of size a. 
Thus the defining condition on T,,, specifies a single isomorphism class. | 


5.2.8. Lemma. Among simple r-partite (that is, r-colorable) graphs with n 
vertices, the Turan graph is the unique graph with the most edges. 

Proof: As noted before Definition 5.2.6, we need only consider complete r- 

partite graphs. Given a complete r-partite graph with partite sets differing by 
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more than 1 in size, we move a vertex v from the largest class (size i) to the 
smallest class (size j). The edges not involving v are the same as before, but 
v gains i — 1 neighbors in its old class and loses j neighbors in its new class. 
Since i—1 > j, the number of edges increases. Hence we maximize the number 
of edges only by equalizing the sizes as in T,,,,. | 


We used the idea of this local alteration previously in Theorem 1.3.19 and 
in Theorem 1.3.23; we are finding the largest r-partite subgraph of K,. 

What happens if we have more edges and thus force chromatic number at 
least r + 1? We have seen that there are graphs with chromatic number r + 1 
that have no triangles. Nevertheless, if we go beyond the maximum number of 
edges in an r-colorable graph with n vertices, then we are forced not only to use 
r +1 colors but also to have K,,; as a subgraph. 

This famous result of Turan generalizes Theorem 1.3.23 and is viewed as 
the origin of extremal graph theory. 


5.2.9. Theorem. (Turan [1941]) Among the n-vertex simple graphs with no 
r + 1-clique, T,,, has the maximum number of edges. 


Proof: The Turan graph T,,,, like every r-colorable graph, has no r + 1-clique, 
since each partite set contributes at most one vertex to each clique. If we can 
prove that the maximum is achieved by an r-partite graph, then Lemma 5.2.8 
implies that the maximum is achieved by T,,,. Thus it suffices to prove that if 
G has no r + 1-clique, then there is an r-partite graph H with the same vertex 
set as G and at least as many edges. = 

We prove this by induction on r. When r =1, Gand H have no edges. For 
the induction step, consider r > 1. Let G be an n-vertex graph with no r + 1- 
clique, and let x € V(G) be a vertex of degree k = A(G). Let G’ be the subgraph 
of G induced by the neighbors of x. Since x is adjacent to every vertex in G’ 
and G has nor + 1-clique, the graph G’ has no r-clique. We can thus apply the 
induction hypothesis to G’; this yields an r — 1-partite graph H’ with vertex set 
N(x) such that e(H’) > e(G’). 

Let H be the graph formed from H’ by joining all of N(x) to all of S = 
V(G) — N(x). Since S is an independent set, H is r-partite. We claim that 
e(H) => e(G). By construction, e(H) = e(H’) + k(n — k). We also have e(G) < 
e(G’) + >).<5 dg (v), since the sum counts each edge of G once for each endpoint 
it has outside V(G’). Since A(G) = k, we have dg(v) < k for each v € S, and 
|S] =n—k, so )o,¢5dg(v) < k(n —k). As desired, we have 


e(G) <e(G) + (n—ADk < e(A’) + k(n —k) =e(A) | 


x x 
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In fact, the Turan graph is the unique extremal graph (Exercise 21). Exer- 
cises 16—24 pertain to Turaén’s Theorem, including alternative proofs, the value 
of e(T,,,), and applications. The argument used in Theorem 1.3.23 was simply 
one instance of the induction step in Theorem 5.2.9. 

Turan’s theorem applies to extremal problems when some condition forbids 
cliques of a given order; we describe a geometric application from Bondy—Murty 
[1976, p113—115]. 


5.2.10.* Example. Distant pairs of points. In a circular city of diameter 1, we 
might want to locate n police cars to maximize the number of pairs that are 
far apart, say separated by distance more than d = 1/2. If six cars occupy 
equally spaced points on the circle, then the only pairs not more than d apart 
are the consecutive pairs around the outside: there are nine good pairs. 
Instead, putting two cars each near the vertices of an equilateral triangle 
with side-length /3/2 yields three bad pairs and twelve good pairs. (This may 
not be the socially best criterion!) In general, with [n/3] or |n/3] cars near 
each vertex of this triangle, the good pairs correspond to edges of the tripartite 
Turan graph. We show next that this construction is best. a 


§.2.11.* Application. In a set of n points in the plane with no pair more than 
distance 1 apart, the maximum number of pairs separated by distance more 
than 1/2 is |n?/3|. 


Proof: Draw a graph G on these points by making points adjacent when the dis- 
tance between them exceeds 1/./2. By Turan’s Theorem and the construction 
in Example 5.2.10, it suffices to show that G has no K4. 

Among any four points, some three form an angle of at least 90°: if the four 
form a convex quadrilateral, then the interior angles sum to 360°, and if one 
point is inside the triangle formed by the others, then with them it forms three 
angles summing to 360°. 

Suppose that G has a 4-vertex clique with points w, x, y,z, where <xyz > 
90°. Since the lengths of xy and yz exceed 1//2, xz is longer than the hy- 
potenuse of a right triangle with legs of length 1//2. Hence the distance 
between x and z exceeds 1, which contradicts the hypothesis. | 
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Even without the full structural statement of Turdn’s Theorem, one can 
prove directly a rough bound on the number of edges in an n-vertex graph with 
no K,,; (Exercise 16). Turning this around yields a sharp lower bound on the 
chromatic number of a graph in terms of the number of vertices and number of 
edges (Exercise 17). 


COLOR-CRITICAL GRAPHS 


The Turan graph solves a problem that is somehow opposite to under- 
standing what forces chromatic number k. It considers the maximal graphs 
that avoid needing k colors instead of the minimal graphs that do need k colors. 

Every k-chromatic graph has a k-critical subgraph, since we can continue 
discarding edges and isolated vertices without reducing the chromatic number 
until we reach a point where every such deletion reduces the chromatic number. 
Thus knowing the k-critical graphs could help us test for k — 1-colorability. We 
begin with elementary properties of k-critical graphs. 


5.2.12. Remark. A graph G with no isolated vertices is color-critical if and only 
if x(G — e) < x(G) for every e € E(G). Hence when we prove that a connected 
graph is color-critical, we need only compare it with subgraphs obtained by 
deleting a single edge. | 
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§.2.13. Proposition. Let G be a baritecal graph. 
a) For v € V(G), there is a proper k-coloring of G in which the color on v 
appears nowhere else, and the other k — 1 colors appear on N(v). 
b) For e € E(G), every proper k — 1-coloring of G — e gives the same color 
to the two endpoints of e. 


Proof: (a) Given a proper k — 1-coloring f of G — v, adding color k on v alone 
completes a proper k-coloring of G. The other colors must all appear on N(v), 
since otherwise assigning a missing color to v would complete a proper k — 1- 
coloring of G. 

(b) If some proper k — 1-coloring of G —e gave distinct colors to the endpoints 
of e, then adding e would yield a proper k — 1-coloring of G. | 


For any graph G, Proposition 5.2.13a holds for every v € V(G) such that 
x(G — v) < x(G) =k, and Proposition 5.2.13b holds for every e € E(G) such 
that x(G — e) < x(G) =k. 


5.2.14. Example. The graph C; v K, of Example 5.1.8 is color-critical. In gen- 
eral, the join of two color-critical graphs is always color-critical. This is easy 
to prove using Remark 5.2.12 and Proposition 5.2.13 by considering cases for 
the deletion of an edge; the deleted edge e may belong to G or H or have an 
endpoint in each (Exercise 3). a 


We proved in Lemma 5.1.18 that 6(G) => k — 1 when G is a k-critical graph. 
We can strengthen this to x’(G) > k — 1 by using the Kénig—Egervary Theorem. 
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5.2.15. Lemma. (Kainen) Let G be a graph with x(G) > k, and let X,Y bea 
partition of V(G). If G[X] and G[Y] are k-colorable, then the edge cut [X, Y] 
has at least k edges. 


Proof: Let X,,..., X, and Y;,..., ¥, be the partitions of X and Y formed by the 
color classes in proper k-colorings of G[X] and G[Y]. If there is no edge between 
X; and Y;, then X; UY; is an independent set in G. We show that if |[X, Y]| < k, 
then we can combine color classes from G[X] and G[Y] in pairs to form a proper 
k-coloring of G. 

Form a bipartite graph H with vertices X;,..., X, and Y;,..., Y,, putting 
X,Y; € E(H) if in G there is no edge between the set X; and the set Y;. If 
|[X, Y]| < k, then H has more than k(k — 1) edges. Since m vertices can cover at 
most km edges in a subgraph of K,,, E(H) cannot be covered by k — 1 vertices. 
By the Konig—Egervary Theorem, H therefore has a perfect matching M. 

In G, we give color i to all of X; and all of the set Y,; to which it is matched 
by M. Since there are no edges joining X; and Y,, doing this for all i produces a 
proper k-coloring of G, which contradicts the hypothesis that x(G) > k. Hence 


we conclude that |[X, Y]| > k. a 
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5.2.16. Theorem. (Dirac [1953]) Every k-critical graph is k — 1-edge-connected. 
Proof: Let G be a k-critical graph, and let [X, Y] be a minimum edge cut. Since 


G is k-critical, G[X] and G[Y] are k — 1-colorable. Applied with k — 1 as the 
parameter, Lemma 5.2.15 then states that |[X, Y]] >k —1. a 


Although a k-critical graph must be k — 1-edge-connected, it need not be 
k — 1-connected; Exercise 32 shows how to construct k-critical graphs that have 
connectivity 2. Nevertheless, we can restrict the behavior of small vertex cut- 
sets in k-critical graphs. 


5.2.17. Definition. Let S be a set of vertices in a graph G. An S-lobe of G is 
an induced subgraph of G whose vertex set consists of S and the vertices 
of a component of G — S. 
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For every S C V(G), the yraph G is the union of its S-lobes. We use this to 
prove a statement about vertex cutsets in k-critical graphs that will be useful 
in the next theorem. Exercise 33 strengthens the result when |S| = 2. 


5.2.18. Proposition. If G is k-critical, then G has no cutset consisting of pair- 
wise adjacent vertices. In particular, if G has a cutset § = {x, y}, then 
x # y and G has an S-lobe H such that x(H + xy) =k. 


Proof: Let S be a cutset in a k-critical graph G. Let H,..., H, be the S-lobes 
of G. Since each H; is a proper subgraph of a k-critical graph, each H; is k — 1- 
colorable. If each H; has a proper k — 1-coloring giving distinct colors to the 
vertices of S, then the names of the colors in these k — 1-colorings can be per- 
muted to agree on S. The colorings then combine to form a k — 1-coloring of G, 
which is impossible. 

Hence some S-lobe H has no proper k — 1-coloring with distinct colors on 
S. This implies that S is not a clique. If S = {x, y}, then every k — 1-coloring of 
H assigns the same color to x and y, and hence H + xy is not k — 1-colorable. # 


FORCED SUBDIVISIONS 


rks 


We need not have a k-clique to have chromatic number k, but perhaps we 
must have some weakened form of a k-clique. 


P 
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5.2.19. Definition. An H-subdivision (or subdivision of H) is a graph ob- 
tained from a graph H by successive edge subdivisions (Definition 5.2.19). 
Equivalently, it is a graph obtained from H by replacing edges with pair- 
wise internally disjoint paths. 


a subdivision of K4 


5.2.20. Theorem. (Dirac [1952a]) Every graph with chromatic number at least 
4 contains a K4-subdivision. 


Proof: We use induction on n(G). 

Basis step: n(G) = 4. The graph G can only be K, itself. 

Induction step: n(G) > 4. Since x(G) > 4, we may let H bea a 4-critical 
subgraph of G. By Proposition 5.2.18, H has no cut-vertex. If «(H) = 2 and 
S = {x, y} is a cutset of size 2, then by Proposition 5.2.18 x # y and H has 
an S-lobe H’ such that x(H’+ xy) > 4. Since n(H’) < n(G), we can apply the 
induction hypothesis to obtain a K4-subdivision in H’. 
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This K4-subdivision F appears also in G unless it contains xy (see figure 
below). In this case, we modify F to obtain a K4-subdivision in G by replacing 
the edge xy with an x, y-path through another S-lobe of H. Such a path ex- 
ists because the minimality of the cutset S implies that each vertex of S has a 
neighbor in each component of H — S. 

Hence we may assume that H is 3-connected. Select a vertex x € V(G). 
Since H — x is 2-connected, it has a cycle C of length at least 3. (Let x be 
the central vertex and C the outside cycle in the figure above.) Since H is 3- 
connected, the Fan Lemma (Theorem 4.2.23) yields an x, V(C)-fan of size 3 in 
H. These three paths, together with C, form a K4-subdivision in H. a 


x) 


§.2.21.* Remark. Hajés [1961] conjectured that every k-chromatic graph con- 
tains a subdivision of K;. For k = 2, the statement says that every 2-chromatic 
graph has a nontrivial path. For k = 3, it says that every 3-chromatic graph 
has a cycle. Theorem 5.2.20 proves it for k = 4, and it is open for k € {5, 6}. 
Hajés’ Conjecture is false for k > 7% (Catlin [1979]—see Exercise 40). Had- 
wiger [1943] proposed a weaker conjecture: every k-chromatic graph has a 
subgraph that becomes K;, via edge contractions. This is weaker because a K;,- 
subdivision is a special subgraph of this:type. For k = 4, Hadwiger’s Conjecture 
is equivalent to Theorem 5.2.20. For k = 5, it is equivalent to the Four Color 
Theorem (Chapter 6). For k = 6, it was proved using the Four Color Theorem 
by Robertson, Seymour, and Thomas [1993]. For k > 7, it remains open. | 


Some results about k-critical graphs extend to the larger class of graphs 
with 6(G) > k — 1. For example, every graph with minimum degree at least 3 
has a K4-subdivision (Exercise 38); this strengthens Theorem 5.2.20. 

Dirac [1965] and Jung [1965] proved that sufficiently large chromatic num- 
ber forces a K,-subdivision in G. Mader improved this by weakening the hy- 
pothesis and generalizing the conclusion: for a simple graph F, every simple 
graph G with 5(G) > 2°) contains a subdivision of F. The threshold 2°”) is 
larger than necessary but permits a short proof. 


5.2.22.* Lemma. (Mader [1967], see Thomassen [1988]) If G is a simple graph 
with minimum degree at least 2k, then G contains disjoint subgraphs G’ 
and H such that 1) H is connected, 2) 5(G’) > k, and 3) each vertex of G’ 
has a neighbor in H. 


Proof: We may assume that G is connected. Let G - H’ denote the graph ob- 
tained from G by contracting the edges of a connected subgraph H’ and delete 
extra copies of multiple edges. In G- H’, the set V(H’) becomes a single ver- 
tex. Consider all connected subgraphs H’ of G such that G - H’ has at least 
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k(n(G) — n(H’) + 1) edges. Since 5(G) > 2k, every 1-vertex subgraph of G is 
such a subgraph. Since such subgraphs exist, we may choose H to be a maximal 
subgraph with this property. 

Let S be the set of vertices outside H with neighbors in H, and let G’ = 
G[S]. We need only show that 5(G’) > k. Each x € V(G’) has a neighbor 
y € V(A). InG-(H Uxy), the edges incident to x in G’ collapse onto edges from 
V(G’) to H that appear in G - H, and the edge xy contracts. Hence e(G - H) — 
e(G - (H Uxy)) = dg (x) + 1. By the choice of H, this difference is more than k, 
and hence 6(G’) > k. | 


5.2.23.* Theorem. (Mader [1967], see Thomassen [1988]) If F and G are sim- 
ple graphs with e(F) = m and 6(F) > 1, then 6(G) > 2” implies that G 
contains a subdivision of F. 


Proof: We use induction on m. The claim is trivial for m < 1. Consider m > 2. 
By Lemma 5.2.22, we may choose disjoint subgraphs H and G’ in G such that 
H is connected, 5(G’) > 2”~1, and every vertex of G’ has a neighbor in H. 

If F has an edge e = xy such that 5(F — e) > 1, then the induction hy- 
pothesis yields a subdivision J of F — e in G’. A path through H can be added 
between the vertices of J representing x and y to complete a subdivision of F. 

If 5(F — e) = 0 for alle € E(F), then every edge of F is incident to a leaf. 
Now F is a forest of stars, and 5(G) =o > 2m allows us to find F itself in G; 
we leave this claim to Exercise 42. | 


er 


5.2.24.* Remark. The case when F is-a complete graph remains of particular 
interest. Let f(k) be the minimum d-such that every graph with minimum 
degree at least d contains a K,- -subdivision. Theorem 5.2.23 yields f(k) < a(3), ; 
Koml6és—Szemerédi [1996] and Bolloba4s~Thomason [1998] proved that f(k) < 
ck* for some constant c (the latter shows c < 256). Since Kmm—1 has no Ko- 
subdivision when m = k(k + 1)/2 (Exercise 41), we have f (k) > k?/8. 

Exercise 38 yields f(4) = 3. Furthermore, f(5) = 6. The icosahedron 
(Exercise 7.3.8) yields f(5) > 6, since this graph is 5-regular and has no Ks5- 
subdivision. On the other hand, Mader [1998] proved Dirac’s conjecture [1964] 
that every n-vertex graph with at least 3n — 5 edges contains a Ks-subdivision. 
By the degree-sum formula, 5(G) > 6 yields at least 3n edges; hence f (5) < 6. 

Finally, we note that Scott [1997] proved a subdivision version of the 
Gyarfas-Sumner Conjecture (Remark 5.2.4) for each tree T and integer k: If G 
has with no k-clique but x (G) is sufficiently large, then G contains a subdivision 
of T as an induced subgraph. | 


EXERCISES 


5.2.1. (—) Let G be a graph such that x (G — x — y) = x(G) —2 for all pairs x, y of distinct 
vertices. Prove that G is a complete graph. (Comment: Lovasz conjectured that the 
conclusion also holds when the condition is imposed only on pairs of adjacent vertices.) 
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5.2.2. (—) Prove that a simple graph is a complete multipartite graph if and only if it 
has no 3-vertex induced subgraph with one edge. 


5.2.3. (—) The results below imply that there is no k-critical graph with k + 1 vertices. 
a) Let x and y be vertices in a k-critical graph G. Prove that N(x) C N(y) is 
impossible. Conclude that no k-critical graph has k + 1 vertices. 
b) Prove that x(G Vv H) = x(G)+ x (A), and that G v H is color-critical if and only if 
both G and H are color-critical. Conclude that C; Vv K,_3, with k + 2 vertices, is k-criti: 1. 


5.2.4. For ne N, let G be the graph with vertex set {vo,..., v3,} defined by 1; > if 
and only if |i — j| < 2 andi + j is not divisible by 6. 

a) Determine the blocks of G. 

b) Prove that adding the edge vov3, to G creates a 4-critical graph. 


5.2.5. (—) Find a subdivision of K, in the Grétzsch graph (Example 5.2.2). 


5.2.6. Determine the minimum number of edges in a connected n-vertex graph with 
chromatic number k. (Hint: Consider a k-critical subgraph.) (Ersov—Kozuhin [1962]— 
see Bhasker-Samad—West [1994] for higher connectivity.) 


5.2.7. (!) Given an optimal coloring of a k-chromatic graph, prove that for each color i 
there is a vertex with color i that is adjacent to vertices of the other k — 1 colors. 


5.2.8. Use properties of color-critical graphs to prove Proposition 5.1.14 again: x(G) < 
1 + max; min{d;, i — 1}, where d, > --- > d,-are the vertex degrees in G. 


5.2.9. (!) Prove that if G isa color-critical graph, then the graph G’ generated from it 
by applying Mycielski’s construction is also:color-critical. 


5.2.10. Given a graph G with vertex set Uy, ..., U,, let G’ be the graph generated from 
G by Mycielski’s construction. Let H be a subgraph of G. Let G” be the graph obtained 
from G’ by adding the edges {u;u;: v;v; € E(H)}. Prove that x(G”) = x(G) + 1 and that 
w(G") = max{w(G), w(A) + 1}. (Pritikin) 


5.2.11. (!) Prove that if G has no induced 2K, then x(G) < as *). (Hint: Use a 


maximum clique to define a collection of Cy) + w(G) independent sets that cover the 
vertices. Comment: This is a special case of the Gyarfas—-Sumner Conjecture—Remark 
5.2.4) (Wagon [1980]) 


§.2.12. (!) Let G; = K,. For k > 1, construct G, as follows. To the disjoint union 
Gi +--:+ G,_1, and add an independent set T of size Th: n(G;). For each choice of 
(v1, ..., Ue-1) in V(G,) x---x V(G,_1), let one vertex of T have neighborhood {v, ..., vz_1}. 
(In the sketch of G, below, neighbors are shown for only two elements of T.) 

a) Prove that w(G,) = 2 and x(G,) =k. (Zykov [1949]) 

b) Prove that G, is k-critical. (Schauble [1969]) 


G1 Ga G3 


A ho aes 
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5.2.13. (+) Let G be a k-chromatic graph with girth 6 and order n. Construct G’ as 
follows. Let T be an independent set of kn new vertices. Take (*") pairwise disjoint 
copies of G, one for each way to choose an n-set S Cc T. Add a matching between each 
copy of G and its corresponding n-set S. Prove that the resulting graph has chromatic 
number k + 1 and girth 6. (Comment: Since Cg, is 2-chromatic with girth 6, the process 
can start and these graphs exist.) (Blanche Descartes [1947, 1954]) 


G G G G 
Wi C) ('") copies 
T 


5.2.14. Chromatic number and cycle lengths. 

a) Let v be a vertex in a graph G. Among all spanning trees of G, let T be one 
that maximizes }° evg) 2r(u, v). Prove that every edge of G joins vertices belonging to 
a path in T starting at v. 

b) Prove that if x(G) > k, then G has a cycle whose length is one more than a 
multiple of k. (Hint: Use the tree T of part (a) to define a k-coloring of G.) (Tuza) 


5.2.15. (!) Prove that a triangle-free graph with n vertices is colorable with 2./n colors. 
(Comment: Thus every k-chromatic triangle-free graph has at least k?/4 vertices.) 


5.2.16. (!) Prove that every n-vertex simple graph with no r + 1-clique has at most 
(1 — 1/r)n?/2 edges. (Hint: This can be proved using Turaén’s Theorem or by induction 
on r without Turdn’s Theorem.) g 


5.2.17. (!) Let G be a simple n-vertex graph with m edges. 
a) Prove that w(G) > [n? /(n? - 2m) | and that this bound is sharp. (Hint: Use Ex- 


ercise 5.2.16. Comment: This also yields x(G) > [n? /(n? - 2m) | .) (Myers—Liu [1972]) 
b) Prove that a(G) > [n/(d + 1)], where d is the average vertex degree of G. (Hint: 
Use part (a).) (Erdés—Gallai [1961]) 


§.2.18. The Turan graph T,,,, (Example 5.2.7) is the complete r-partite graph with b par- 
tite sets of size a + 1 and r — b partite sets of size a, where a = |[n/r] andb=n-—ra. 

a) Prove that e(T,-) = (1 — 1/r)n?/2 — b(r — b)/(2r). 

b) Since e(G) must be an integer, part (a) implies e(T,,,) < Kel —1/r)n? /2\. Deter- 
mine the smallest r such that strict inequality occurs for some n. For this value of r, 
determine all n such that e(7,,,) < Ke - 1/r)n?/2 |. 


5.2.19. (+) Let a = |n/r]. Compare the Turan graph T,,, with the graph K, + Kn-, to 
prove directly that e(T,,-) = ("3*) + ¢ — 1)(“f"). 


5.2.20. Given positive integers n and k, let g = |n/k], r =n — qk, s = |n/(k+1)J, and 
t=n—s(k+1). Prove that (Z)k +rq> (5) (k+1)+ts. (Hint: Consider the complement 
of the Turan graph.) (Richter [1993]) 


§.2.21. Prove that among the n-vertex simple graphs with no r + 1-clique, the Turan 
graph 7,,, is the unique graph having the maximum number of edges. (Hint: Examine 
the proof of Theorem 5.2.9 more carefully.) 


5.2.22. A circular city with diameter four miles will get 18 cellular-phone power sta- 
tions. Each station has a transmission range of six miles. Prove that no matter where 
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in the city the stations are placed, at least two will each be able to transmit to at least 
five others. (Adapted from Bondy—Murty [1976, p115]) 


5.2.23. (!) Turdn’s proof of Turdn’s Theorem, including uniqueness (Turan [1941]). 

a) Prove that a maximal simple graph with no r + 1-clique has an r-clique. 

b) Prove that e(T,.-) = (5) + —r)@ — 1) + e(Tn-rr)- 

c) Use parts (a) and (b) to prove Turan’s Theorem by induction on n, including the 
characterization of graphs acheiving the bound. 


5.2.24. (+) Let t,(n) = e(T,,,). Let G be a graph with n vertices that has t,(n) — k edges 
and at least one r + 1-clique, where k > 0. Prove that G has at least f,(n) + 1— cliques 
of order r + 1, where f,(n) =n — [n/r] —r. (Hint: Prove that a graph with exactly one 
r + 1-clique has at most t,(n) — f,(n) edges.) (Erdés [1964], Moon [1965c]) 


5.2.25. Partial analogue of Turdn’s Theorem for Kom 

a) Prove that if G is simple and bie eViG) Co) > Ci - (5 ), then G contains Kom. 
(Hint: View Ko. as two vertices with m common neighbors.) 

b) Prove that a vig) (“9”) > e(2e/n — 1), where G has e edges. 

c) Use parts (a) and (b) to prove that a graph with more than (m — 1)¥/2n3/2 + n/4 
edges contains K2 m. 

d) Application: Given n points in the plane, prove that the distance is exactly 1 for 
at most Jqni? +n/4 pairs. (Bondy—Murty [1976, p111-112]) 


5.2.26. For n > 4, prove that every n-vertex graph with more than inn — 1 edges has 
girth at most 4. (Hint: Use the methods of ‘Exercise 5.2.25 


5.2.27. (+) For n > 6, prove that the maximum number of edges in a simple m-vertex 
graph not having two edge-disjoint cycles is n + 3. (Pésa) 


5.2.28. (+) For n > 6, prove that the maximum number of edges in a simple n-vertex 
graph not having two disjoint cycles is 3n —6. (Pésa) 


5.2.29. (!) Let G be a claw-free graph (no induced K 1,3). 

a) Prove that the subgraph induced by the union of any two color classes in a proper 
coloring of G consists of paths and even cycles. 

b) Prove that if G has a proper coloring using exactly k colors, then G has a proper 
k-coloring where the color classes differ in size by at most one. (Niessen—Kind [2000]) 


5.2.30. (+) Prove that if G has a proper coloring g in which every color class has at least 
two vertices, then G has an optimal coloring f in which every color class has at least two 
vertices. (Hint: If f has a color class with only one vertex, use g to make an alteration 
in f. The proof can be given algorithmically or by induction on x(G).) (Gallai [1963c]) 


5.2.31. Let G be a connected k-chromatic graph that is not a complete graph or a cycle of 
length congruent to 3 modulo 6. Prove that every proper k-coloring of G has two vertices 
of the same color with a common neighbor. (Tomescu) 
5.2.32. (!) The Hajés construction (Hajés [1961)). 

a) Let G and H be k-critical graphs sharing only vertex v, with vu € E(G) and 
vw € E(H). Prove that (G — vu) U (A — vw) Uuw is k-critical. 

b) For all k > 3, use part (a) to obtain a k-critical graph other than K;,. 

c) For all n > 4 except n = 5, construct a 4-critical graph with n vertices. 


Ges 
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5.2.33. Let G be a k-critical graph having a separating set S = {x, y}. By Proposition 
5.2.18, x # y. Prove that G has exactly two S-lobes and that they can be named Gj, G2 
such that G, + xy is k-critical and G2 - xy is k-critical (here Gz - xy denotes the graph 
obtained from Gz by adding xy and then contracting it). 


§.2.34. (!) Let G be a 4-critical graph having a separating set S of size 4. Prove that 
G[S] has at most four edges. (Pritikin) 


5.2.35. (+) Alternative proof that k-critical graphs are k — 1-edge-connected. 

a) Let G be a k-critical graph, with k > 3. Prove that for every e, f € E(G) there is 
a k — 1-critical subgraph of G containing e but not f. (Toft [1974]) 

b) Use part (a) and induction on k to prove Dirac’s Theorem that every k-critical 
graph is k — 1-edge-connected. (Toft [1974]) 


5.2.36. (+) Prove that if G is k-critical and every k — 1-critical subgraph of G is isomor- 
phic to K,_1, then G = K, (if k > 4) (Hint: Use Toft’s critical graph lemma—Exercise 
5.2.35a.) (Stiebitz [1985]) 


5.2.37. A graph G is vertex-color-critical if x(G — v) < x(G) for all v € V(G). 

a) Prove that every color-critical graph is vertex-color-critical. 

b) Prove that every 3-chromatic vertex-color-critical graph is color-critical. 

c) Prove that the graph below is vertex-color-critical but not color-critical. (Com- 
ment: This is not the Grétzsch graph.) 


5.2.38. (!) Prove that every simple graph with minimum degree at least 3 contains a K4- 
subdivision. (Hint: Prove a stronger result—every nontrivial simple graph with at most 
one vertex of degree less than 3 contains a K,-subdivision. The proof of Theorem 5.2.20 
already shows that every 3-connected graph contains a K4-subdivision.) (Dirac [1952a]) 


5.2.39. (!) Given that 5(G) > 3 forces a K4-subdivision in G, prove that the maximum 
number of edges in a simple n-vertex graph with no K4-subdivision is 2n — 3. 


5.2.40. Thick edges below indicate that every vertex in one circle is adjacent to every 
vertex in the other. Prove that x(G) = 7 but G has no K7-subdivision. Prove that 
x(A) = 8 but H_ has no Kg-subdivision. (Catlin [1979]) 


SS 6 
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5.2.41. Let m = k(k + 1)/2. Prove that Kmm-1 has no K2,-subdivision. 


5.2.42. (+) Let F be a forest with m edges. Let G be a simple graph such that 6(G) > m 
and n(G) > n(F). Prove that G contains F as a subgraph. (Hint: Delete one leaf from 
each nontrivial component of F to obtain F’. Let R be the set of neighbors of the deleted 
vertices. Map R onto an m-set X C V(G) that minimizes e(G[X]). Extend X to a copy of 
F’. Use Hall’s Theorem to show that X can be matched into the remaining vertices to 
complete a copy of F.) (Brandt [1994]) 


5.2.43. (+) Let G be a k-chromatic graph. It follows from Lemma 5.1.18 and Proposition 
2.1.8 that G contains every k-vertex tree as a subgraph. Strengthen this to a labeled 
analogue: if f is a proper k-coloring of G and T is a tree with vertex set {w:,..., wx}, 
then there is an adjacency-preserving map ¢: V(T) — V(G) such that f(¢(w;)) =i for 
alli. (Gyarfas—Szemerédi—Tuza [1980], Sumner [1981]) 


. §.2.44.:(+) Let G be a k-chromatic graph of girth at least 5. Prove that G contains every 
k-vertex tree as an induced subgraph. (Gydrfas—Szemerédi-Tuza [1980]) 


5.3. Enumerative Aspects 


Sometimes we can shed light on a hard problem by considering a more 
general problem. No good algorithm to test existence of a proper k-coloring is 
known (see Appendix B), but still we can study the number of proper k-colorings 
(here we fix a particular set of k colors). The chromatic number x(G) is the 
minimum k such that the count is positive; knowing the count for all k would 
tell us the chromatic number. Birkhoff [1912] introduced this counting problem 
as a possible way to attack the Four Color Problem (Section 6.3). 

In this section, we will discuss properties of the counting function, classes 
where it is easy to compute, and further related topics. 


COUNTING PROPER COLORINGS 


We start by defining the counting problem as a function of k. 


5.3.1. Definition. Given k ¢ N and a graph G, the value x(G; k) is the number 
of proper colorings f: V(G) — [k]. The set of available colors is [k] = 
{1,...,k}; the k colors need not all be used in a coloring f. Changing the 
names of the colors that are used produces a different coloring. 


5.3.2. Example. y(K,;k)) =k" and x(Kn;k)) =k(k —1)---(k—n +1). 

When coloring the vertices of K,, we can use any of the k colors at each 
vertex no matter what colors we have used at other vertices. Each of the k” 
functions from the vertex set to [k] is a proper coloring, and hence x(K,,; k) = k". 

When we color the vertices of K,,, the colors chosen earlier cannot be used on 
the ith vertex. There remain k—i+1 choices for the color of the ith vertex no mat- 
ter how the earlier colors were chosen. Hence x (K,,; k) = k(k —1)---(K —n +1). 
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We can also count this as (*)n! by first choosing n distinct colors and then mul- 
tiplying by n! to count the ways to assign the chosen colors to the vertices. For 
example, x(K3; 3) = 6 and x (K3; 4) = 24. 

The value of the product is 0 when k < n. This makes sense, since K,, has 
no proper k-colorings when k < n. | 


k k 
e 


r ° 
k k k-2 k-1 


5.3.3. Proposition. If T is a tree with n vertices, then x(T; k) = k(k — 1)""?. 


Proof: Choose some vertex v of T as a root. We can color v in k ways. If we 
extend a proper coloring to new vertices as we grow the tree from v, at each 
step only the color of the parent is forbidden, and we have k — 1 choices for the 
color of the new vertex. Furthermore, deleting a leaf shows inductively that 
every proper k-coloring arises in this way. Hence x(T; k) = k(k — 1)""}. | 
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Another way to count the colorings is to observe that the color classes of 
each proper coloring of G partition V(G) into independent sets. Grouping the 
colorings according to this partition leads to a formula for x(G;k) that is a 
polynomial in k of degree n(G). Note that this holds for the answers in Example 
5.3.2 and Proposition 5.3.3. Since every graph has this property, x(G;k) asa 
function of k is called the chromatic polynomial of G. 


5.3.4. Proposition. Let x) = x(x —1)-::(«—r+1). If p,(G) denotes the num- 
ber of partitions of V(G) into r nonempty independent sets, then x (G; k) = 
Bay Pr(G)k), which is a polynomial in k of degree n(G). 


Proof: When r colors are actually used in a proper coloring, the color classes 
partition V(G) into exactly r independent sets, which can happen in p,(G) 
ways. When k colors are available, there are exactly k,,) ways to choose colors 
and assign them to the classes. All the proper colorings arise in this way, so the 
formula for x(G; k) is correct. 

Since k,,) is a polynomial in k and p,(G) is a constant for each r, this for- 
mula implies that x (G; k) is a polynomial function of k. When G has n vertices, 
there is exactly one partition of G into n independent sets and no partition 
using more sets, so the leading term is k”. a 
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5.3.5. Example. Always p,(G) = 1, using independent sets of size 1. Also 
pi1(G) = 0 unless G has no edges, since only for K,, is the entire vertex set an 
independent set. 

Consider G = C4. There is exactly one partition into two independent sets: 
opposite vertices must be in the same independent set. When r = 3, we put 
two opposite vertices together and leave the other two in sets by themselves; 
we can do this in two ways. Thus p2 = 1, p3 = 2, pg = 1. 


x(Cark) =1-k(k-—N42-kk—1(k—-2)4+1-kk — Wk — 2)(k — 3) 
= k(k — 1)(k? — 3k +3). ry 


Computing the chromatic polynomial in this way is not generally feasible, 
since there are too many partitions to consider. There is a recursive computa- 
tion much like that used in Proposition 2.2.8 to count spanning trees. Again 
G-e denotes the graph obtained by contracting the edge e in G (Definition 2.2.7). 
Since the number of proper k-colorings is unaffected by multiple edges, we dis- 
card multiple copies of edges that arise from the contraction, keeping 
only one copy of each to form a simple graph. 


5.3.6. Theorem. (Chromatic recurrence) If G is a simple graph and e € E(G), 
then x(G; k) = x(G — ek) — x(G-e;k). 

Proof: Every proper k-coloring of G isa proper k-coloring of G — e. A proper k- 

coloring of G — e is a proper k-coloring of G if and only if it gives distinct colors 

to the endpoints u, v of e. Hence we can count the proper k-colorings of G by 

subtracting from x(G — e; k) the number of proper k-colorings of G — e that give 

u and v the same color. 

Colorings of G — e in which u and v — the same color correspond directly 
to proper k-colorings of G - e, in which the color of the contracted vertex is the 
common color of u and v. Such a coloring properly colors all the edges of G - e if 
and only if it properly colors all the edges of G other than e. | 


G G-e G-e 


5.3.7. Example. Proper k-colorings of Cy. Deleting an edge of C4 produces P,, 
while contracting an edge produces K3. Since P, is a tree and K3 is a complete 
graph, we have x(P4;k) = k(k — 1)? and x(K3;k) = k(k — 1)(k — 2). Using the 
chromatic recurrence, we obtain 


x (Ca; k) = x (Pas k) — x(K3; k) = k(k — 1)(k? — 8k +8). = 


Because both G — e and G-e have fewer edges than G, we can use the chro- 
matic recurrence inductively to compute x(G; k). We need initial conditions for 
graphs with no edges, which we have already computed: x(K,;k) =k". 
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5.3.8. Theorem. (Whitney [1933c]) The chromatic polynomial x(G;k) has 
degree n(G), with integer coefficients alternating in sign and beginning 
a —e(G), anes 
Proof: We use induction on e(G). The claims hold trivially when e(G) = 0, 
where x(K,;k) = k". For the induction step, let G be an n-vertex graph with 
e(G) > 1. Each of G —e and G-e has fewer edges than G, and G-e has n — 1 ver- 
tices. By the induction hypothesis, there are nonnegative integers {a;} and {b;} 
such that y(G —e;k) = > ino(— LD)‘ ajk" and x(G-e;k) = yg (Di bk, 
By the chromatic recurrence, 


x(G — e;k): k® — [e(G) — 1]k""? + agk?-? —.-- + (-D/ eee 
— x(G-e;k): —( Rem — by ken ® 4 wee  (—1)/°7 b; yk"! +++) 
= x(G;k): k” — e(G)k""1 + (ag tbyk™? ee + (HDG +b)" 


Hence x(G; k) is a polynomial with leading coefficient ag = 1 and next coeffi- 
cient —(a; + bo) = —e(G), and its coefficients alternate in sign. a 


5.3.9. Example. When adding an edge yields a graph whose chromatic poly- 
nomial is easy to.compute, we can use the chromatic recurrence in a different 
way. Instead of x(G; k) = x(G — e; k) — x(G-e;k), we can write x(G — e;k) = 
x(G; k) + x(G-e; k). Thus we may be able to compute x(G — e; k) using x(G; k). 

To compute x(K, — e;k), for example, we let G be K,, in this alternative 
formula and obtain 

n—3 

ik) =(k—-n+2)] [&-d. w 


i=0 


Ll - x 


We close our general discussion of x(G;k) with an explicit formula. It 
has exponentially many terms, so its uses are primarily theoretical. The for- 
mula summarizes what happens if we iterate the chromatic recurrence until 
we dispose of all the edges. 


X(Kn — €3k) = xX (Kn k) + XK 


telegré ae nat 


5.3.10. Theorem. (Whitney [1932b]) Let c(G) denote the number of compo- 
nents of a graph G. Given a set S C E(G) of edges in G, let G(S) denote 
the spanning subgraph of G with edge set S. Then the number x(G; k) of 
proper k-colorings of G is given by 


X(G; kK) = Dscee)(— Dao 


Proof: In applying the chromatic recurrence, contraction may produce multiple 
edges. We have observed that dropping these does not affect x (G; k). We claim 
that deleting extra copies of edges also does not change the claimed formula. 
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Let e and e’ be edges in G with the same endpoints. When e’ € S and 
e ¢ S, we have c(G(S U {e})) = c(G(S)), since both endpoints of e are in the 
same component of G(S). However, |S U {e}| = |S| +1. Thus the terms for S 
and S U {e} in the sum cancel. Therefore, omitting all terms for sets of edges 
containing e’ does not change the sum. This implies that we can keep or drop. 
e’ from the graph without changing the formula. 

When computing the chromatic recurrence, we therefore obtain the same 
result if we do not discard multiple edges or loops and instead retain all edges 
for contraction or deletion. Iterating the recurrence now yields 2° terms as 
we dispose of all the edges; each in turn is deleted or contracted. 

When all edges have been deleted or contracted, the graph that remains 
consists of isolated vertices. Let S be the set of edges that were contracted. The 
remaining vertices correspond to the components of G(S); each such component 
becomes one vertex when the edges of S are contracted and the other edges are 
deleted. The c(G(S)) isolated vertices at the end yield a term with k°°” col- 
orings. Furthermore, the sign of the contribution changes for each contracted 
edge, so the contribution is positive if and only if |S| is even. 

Thus the contribution when S is the set of contracted edges is (—1)!*!K°(G)), 
and this accounts for all terms in the sum. | 


S} 
a 


5.3.11. Example. A chromatic polynomial. When G is a simple graph with n 
vertices, every spanning subgraph with 0, 1, or 2 edges has n, n — 1, orn — 2 
components, respectively. When |S| =.3, the number of components is n — 2 if 
and only if the three edges form a triangle; otherwise it is n — 3. 

For example, when G is a kite (four vertices, five edges) there are ten sets 
of three edges. For two of these, G(S)-consists of a triangle plus one isolated 
vertex. The other eight sets of three edges yield spanning subgraphs with one 
component. Both types of triples are counting negatively, since |$| = 3. All 
spanning subgraphs with four or five edges have only one component. Hence 
Theorem 5.3.10 yields 


x(G; k) = k* — 5k3 + 10k? — (2k? + 8k!) + 5k —k = k* — BKB + Bk? — Ak. 


This agrees with x (G; k) = k(k—1)(k—2)(k—2), computed by counting colorings 
directly or by using x(G; k) = x(C4; k) — x(P3;k). @ 


Whitney proved Theorem 5.3.10 using the inclusion-exclusion principle of 
elementary counting. Among the universe of k-colorings, the proper colorings 
are those not assigning the same color to the endpoints of any edge. Letting A; 
be the set of k-colorings assigning the same color to the endpoints of edge e;, we 
want to count the colorings that lie in none of Aj, ..., Am (see Exercise 17). 
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Counting colorings is easy for cliques and trees (and the kite) because each 
such graph arises from K, by successively adding a vertex joined to a clique. 
The chromatic polynomial of such a graph is a product of linear factors. 


§.3.12. Definition. A vertex of G is simplicial if its neighborhood in G induces 
a clique. A simplicial elimination ordering is an ordering v,,..., v1 
for deletion of vertices so that each vertex v; is a simplicial vertex of the 
remaining graph induced by {v1,..., v;}. (These orderings are also called 
perfect elimination orderings.) 


5.3.13. Example. Chromatic polynomials from simplicial elimination order- 
ings. In a tree, a simplicial elimination ordering is a successive deletion of 
leaves. We have observed that x(G; k) = k(k — 1)""! when G is an n-vertex tree. 

When v,,..., v; is a simplicial elimination ordering for G, the product rule 
of elementary combinatorics (Appendix A) allows us to count proper k-colorings 
of G. If we have colored v;,..., v;, then when we add v; there are k—d(i) ways to 
color it, where d(i) = |N(v;) N {v1,..., v;-1}|. The factor k — d(i) is independent 
of how previous color choices were made, because the neighbors of v; that have 
been colored form a clique of size d(i) and have distinct colors. 

Deleting a simplicial vertex that starts a simplicial elimination ordering 
yields inductively that. every proper k-coloring of G arises in this way. Thus we 
have expressed the chromatic polynomial as a product of linear factors. 


In the graph below, vg, ..., v; is a simplicial elimination ordering. When we 
form the graph in the order 1}, ..., vg, the values d(1), ..., d(6) are 0, 1, 1, 2, 3, 2, 
and the chromatic polynomial is k(k — 1)(k — 1)(k — 2)(k — 3)(k — 2). a 

U4 Us U6 
V1 vg U3 


5.3.14. Remark. It is important to note that some graphs without simplicial 
elimination orderings also have chromatic polynomials that can be expressed 
as a product of linear factors of the form k — r; with 7; a nonnegative integer. 
Exercise 19 presents an example. Thus the existence of a simplicial elimination 
ordering is a sufficient but not necessary condition for the chromatic polynomial 
to have this nice factorization property. | 


Trees, cliques, near-complete graphs (K,,—e), and interval graphs (Exercise 
28) all have simplicial elimination orderings. When n > 3, the cycle C,, has no 
simplicial elimination ordering, because a cycle has no simplicial vertex to start 
the elimination. The existence of simplicial elimination orderings is equivalent 
to the absence of such cycles as induced subgraphs. 
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5.3.15. Definition. A chord of a cycle C is an edge not in C whose endpoints 
liein C. Achordless cycle in G is a cycle of length at least 4 in G that has 
no chord (that.is, the cycle is an induced subgraph). A graph G is chordal 
if it is simple and has no chordless cycle. 


The motivation for the term “chord” is geometric. If a cycle is drawn with 
its vertices in order on a circle and its chords are drawn as line segments, then 
the chords of the cycle are chords of the circle. 

It is fairly easy to show that a graph with a simplicial elimination ordering 
cannot have a chordless cycle. Thus our characterization of these graphs is 
another TONCAS theorem. We separate the substantive part of the proof of 
sufficiency as a lemma that is useful on its own (see also Laskar—Shier [1983]). 


5.3.16. Lemma. (Voloshin [1982], Farber—Jamison [1986]) For every vertex x 
in a chordal graph G, there is a simplicial vertex of G among the vertices 
farthest from x in G 


Proof: We use induction on n(G). Basis step (n(G) = 1): The one vertex in Ky 
is simplicial. 

Induction step (n(G) > 2): If x is adjacent to all other vertices, then we 
apply the induction hypothesis to the chordal graph G —x. Each simplicial 
vertex y of G — x is also simplicial in G, since x is adjacent to all of N(y) U {y}. 

Otherwise, let T be the set of vertices i in G with maximum distance from x, 
and let H be a component of G[T]. Let-S be the set of vertices in G — T having 
neighbors in V(#), and let Q be the component of G — S containing x. 


telegk 


We claim that S is a clique. Each vertex of S has a neighbor in V(H) and 
a neighbor in Q. For distinct vertices u, v € S, the union of shortest u, v-paths 
through H and through Q is a cycle of length at least 4. Since there are no 
edges from V(H) to V(Q), this cycle has no chord other than uv. Since G has 
no chordless cycle, u <> v. Since u, v € S were chosen arbitrarily, S is a clique. 

Now let G’ = G[S U V(H)]; this omits x and thus is smaller than G. We 
apply the induction hypothesis to G’ and a vertex u € S. Since S is a clique, 

— {u} © N(u). Whether G’ is a clique or not, it thus has a simplicial vertex z 
atin V(A). Since Ng(z) € V(G’), the vertex z is also simplicial in G, and z is 
a vertex with maximum distance from x, as desired. | 
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5.3.17. Theorem. (Dirac [1961]) A simple graph has a simplicial elimination 
ordering if and only if it is a chordal graph. 


Proof: Necessity. Let G be a graph with a simplicial elimination ordering. Let 
C be acycle in G of length at least 4. At the point when the elimination ordering 
first deletes a vertex of C, say v, the remaining neighbors of v form a clique. 
The clique includes the neighbors of v on C; the resulting edge joining them is 
a chord of C. Hence G has no chordless cycle. 


Sufficiency. By Lemma 5.3.16, every chordal graph has a simplicial vertex. 
This yields a simplicial elimination ordering by induction on n(G), since every 
induced subgraph of a chordal graph is a chordal graph. | 


Other properties of chordal graphs appear in Exercises 20-27. 


4 
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A HINT OF PERFECT GRAPH 
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In Proposition 5.1.16, we proved that x(G) = w(G) when G is an interval 
graph. Furthermore, every induced subgraph of an interval graph is also an 
interval graph, since we can delete the interval representing v in an interval 
representation of G to obtain an interval representation of G — v. Thus x(H) = 
w(H) holds for every induced subgraph H of an interval graph. 


5.3.18. Definition. A graph G is perfect if x(H) = w(fA) for every induced 
subgraph H C G. Equivalently, x(G[A]) = w(GI[A]) for all A C V(G). 
The clique cover number 6(G) of a graph G is the minimum number 
of cliques in G needed to cover V(G); note that 0(G) = x(G). 


Since cliques and independent sets exchange roles under complemeiitation, 
the statement of perfection for G is “a(H) = 6(#) for every induced subgraph 
H of G”. Lovasz [1972a, 1972b] proved the Perfect Graph Theorem (PGT): G 
is perfect if and only if its complement G is perfect. We prove this in Theorem 
8.1.6; here we merely illustrate perfect graphs. 


5.3.19. Definition. A family of graphs G is hereditary if every induced sub- 
graph of a graph in G is also a graph in G. 


5.3.20. Remark. In order to prove that every graph in a hereditary class G 
is perfect, it suffices to verify that x(G) = w(G) for every G € G. Doing so 
includes the proof of equality for the induced subgraphs of G. | 
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5.3.21. Example. Bipartite graphs and their line graphs. Bipartite graphs 
form a hereditary class, and y(G) = w(G) for every bipartite graph; hence 
bipartite graphs are perfect. When H is bipartite, the statement of perfection 
for H is Exercise 5.1.38 and follows from a(H) = f’(H) (Corollary 3.1.24). For 
bipartite graphs, the nontrivial a(G) = 6(G) = 8’(G) follows at once from the 
trivial ¥(G) = w(G) by the PGT. 

We briefly introduced line graphs in Definition 4.2.18 to prove the edge 
versions of Menger’s Theorem; recall that the line graph L(G) has a vertex for 
each edge of G, with e, f € V(L(G)) adjacent in L(G) if they have a common 
endpoint in G. Line graphs of bipartite graphs form a hereditary family, since 
deleting a vertex in the line graph represents deleting the corresponding edge 
in the original graph. 

Therefore, proving that a(L(G)) = 6(L(G)) when G is bipartite will show 
that complements of line graphs are perfect. A clique in L(G) (when G is bi- 
partite) consists of edges in G with a common endpoint. Thus covering the 
vertices of L(G) with cliques corresponds to selecting vertices in G to form a 
vertex cover. Independent sets in L(G) are matchings in G. Thus perfection for 
complements of line graphs of bipartite graphs amounts to the Kénig—Egervary 
Theorem (a'(G) = 8(G)) for matchings and vertex covers in bipartite graphs. 

From this the PGT yields also x(L(G)) = w(L(G)). A proper coloring of 
L(G) is a partition of E(G) into matchings, and w(L(G)) = A(G) (for bipartite 
G). Hence x(L(G)) = w(L(G)) means that the edges of a bipartite graph G can 
be partitioned into A(G) matchings. In Theorem 7.1.7, we prove directly this 
additional result of Konig [1916]. 2 | 


ram.1 


Since every interval graph is a chordal graph (Exercise 28), proving that 
all chordal graphs are perfect strengthens Proposition 5.1.16. We explore other 
characterizations of interval graphs and chordal graphs in Section 8.1. 


5.3.22. Theorem. (Berge [1960]) Chordal graphs are perfect. 


Proof: Deleting vertices cannot create chordless cycles, so the family is hered- 
itary. By Remark 5.3.20, we need only prove x(G) = w(G) when G is chordal. 

In Theorem 5.3.17, we proved that G has a simplicial elimination ordering. 
Let v1,..., U, be the reverse of such an ordering. For each i, the neighbors of 1; 
among {v1,..., v;-1} form a clique. 

We apply greedy coloring with this ordering. If v; receives color k, then 
colors 1,...,k — 1 appear on earlier neighbors of v;. Since they form a clique, 
with v; we have a clique of size k. Thus we obtain a clique whose size equals 
the number of colors used. | 


The argument of Theorem 5.3.22 shows that greedy coloring relative to the 
reverse of a simplicial elimination ordering produces an optimal coloring. This 
generalizes Proposition 5.1.16 about interval graphs. 

We present one more fundamental class of perfect graphs; it includes all 
bipartite graphs. 
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§.3.23.* Definition. A transitive orientation of a graph G is an orientation 
D such that whenever xy and yz are edges in D, also there is an edge xz in 
G that is oriented from x to z in D. A simple graph G is a comparability 
graph if it has a transitive orientation. 


5.3.24.* Example. If G is an X, Y-bigraph, then directing every edge from X to 
Y yields a transitive orientation. Thus every bipartite graph is a comparability 
graph. Transitive orientations arise from order relations; x — y could mean “x 
contains y”, which is a transitive relation. | 


§.3.25.* Proposition. (Berge [1960]) Comparability graphs are perfect. 


Proof: Every induced subdigraph of a transitive digraph is transitive, so the 
class of comparability graphs is hereditary. Thus we need only show that each 
comparability graph G is w(G)-colorable. 

Let F be a transitive orientation of G; note that F has no cycle. As shown 
in proving Theorem 5.1.21, the coloring of G that assigns to each vertex v the 
number of vertices in the longest path of F ending at v is a proper coloring. 
By transitivity, the vertices of a path in F form a clique in G. Thus we have 
x(G) < w(G). a 


books 


COUNTING ACYCLIC ORIENTATIONS (optional) 
Surprisingly, x(G; k) has meaning when k is anegative integer. An acyclic 
orientation of a graph is an orientation having no cycle. Setting k = —1 in 


x (G; k) enables us to count the acyclic orientations of G. 


5.3.26. Example. Since C, has 4 edges, it has 16 orientations. Of these, 14 
are acyclic. In Example 5.3.7, we proved that x (C4; k) = k(k — 1)(k? — 8k + 8). 
Evaluated at k = —1, this equals (—1)(—2)(7) = 14. a 


5.3.27. Theorem. (Stanley [1973]) The value of x(G;k) at k = —1 is (—1)" 
times the number of acyclic orientations of G. 


Proof: We use induction on e(G). Let a(G) be the number of acyclic orientations 
of G. When G has no edges, a(G) = 1 and x(G; —1) = (—1)", so the claim 
holds. We will prove that a(G) = a(G — e) + a(G - e) for e € E(G). If so, then 
we apply the recurrence for a, the induction hypothesis for a(G) in terms of 
x(G; k), and the chromatic recurrence te compute 


a(G) = (-1)" x (G — e; —1) + (-1I)"O-1*x (G+ e; -1) = (-1)" x (G; - 1). 


Now we prove the recurrence for a. Every acyclic orientation of G contains 
an acyclic orientation of G — e. An acyclic orientation D of G — e may extend to 
0, 1, or 2 acyclic orientations of G by orienting the edge e = uv. When D has 
no u, v-path, we can choose v > u. When D has no v, u-path, we can choose 
u — v. Since D is acyclic, D cannot have both a u, v-path and a v, u-path, so 
the two choices for e cannot both be forbidden. 
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Hence every D extends in at least one way, and a(G) equals a(G—e) plus the 
number of orientations that extend in both ways. Those extending in both ways 
are the acyclic orientations of G — e with nib u, v-path and no v, u-path. There 
are exactly a(G - e) of these, since a u, v-path or a v, u-path in an orientation of 
G — e becomes a cycle in G - e. | 


The interpretation of x(G;k) for general negative k (Exercise 32) is an 
instance of the phenomenon of “combinatorial reciprocity” (Stanley [1974]). 


EXERCISES 
Keep in mind that the notation x(G; k) may be viewed as a polynomial or as the 
number of proper k-colorings of G. 


5.3.1. (—) Compute the chromatic polynomials of the graphs below. 


5.3.2. (—) Use the chromatic recurrence 
tree with n vertices. 


5.3.3. (—) Prove that k* — 4k° + 3k? is not 


obtain the chromatic polynomial of every 


a 


chromatic polynomial. 
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5.3.4. a) Prove that x(C,;k) = (k — 1)" + (-—1)"(k — D). 
b) For H = GV K;,, prove that x(F; k) = kx(G; k — 1). From this and part (a), find 
the chromatic polynomial of the wheel C,, Vv Ky. 


§.3.5. For n = 1, let G, = P, 0 Ko; this is the graph with 2n vertices and 3n — 2 edges 
shown below. Prove that x (G,,; k) = (k? — 8k + 8)""k(k — 1). 


5.3.6. (!) Let G be a graph with n vertices. Use Proposition 5.3.4 to give a non-inductive 


proof that the coefficient of k”~! ih x (G;k) is —e(G). 


5.3.7. Prove that the chromatic polynomial of an n-vertex graph has no real root larger 
than n — 1. (Hint: Use Proposition 5.3.4.) 


5.3.8. (!) Prove that the number of proper k-colorings of a connected graph G is less 
than k(k — 1)""! if k > 3 and G is not a tree. What happens when k = 2? 


5.3.9. (!) Prove that x(G;x + y) = rere x(G[U]; x)x (GU); y). (Hint: Since both 
sides are polynomials, it suffices to prove equality when x and y are positive integers; 
do this by counting proper x + y-colorings in a different way.) 
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5.3.10. Let G be a connected graph with x(G;k) = yo (Dia kt. For 1 <i <7, 
prove that a; > (";*). (Hint: Use the chromatic recurrence.) 


5.3.11. (!) Prove that the sum of the coefficients of x(G; k) is 0 unless G has no edges. 
(Hint: When a function is a polynomial, how can one obtain the sum of the coefficients?) 


5.3.12. (+) Coefficients of x(G; k). 

a) Prove that the last nonzero term in the chromatic polynomial of G is the term 
whose exponent is the number of components of G. 

b) Use part (a) to prove that if p(k) = k" —ak""!+.--+ck’ anda > Co: then pis 
not a chromatic polynomial. (For example, this immediately implies that the polynomial 
in Exercise 5.3.3 is not a chromatic polynomial.) 


5.3.13. Let G and H be graphs, possibly overlapping. 

a) Prove that x(G UA;k) = eC ae when GM H is a complete graph. 

b) Consider two paths whose union is a cycle to show that the formula may fail 
when G/N H is not a complete graph. 

c) Apply part (a) to conclude that the chromatic number of a graph is the maximum 


of the chromatic numbers of its blocks. 


5.3.14. (!) Let P be the Petersen graph. By Brooks’ Theorem, the Petersen graph is 3- 
colorable, and hence by the pigeonhole principle it has an independent set S of size 4. 

a) Prove that P — S = 3Kge. 

b) Using part (a) and symmetry, Geuaaae the number of vertex partitions of P 
into three independent sets. 

c) In general, how can the number of sartitions into the minimum number of inde- 
pendent sets be obtained from the chromatic polynomial of G? 


5.3.15. Prove that a graph with chromaticnumber k has at most k"~* vertex partitions 
into k independent sets, with equality achieved only by K, + (n —k)K, (a k-clique plus 
n —k isolated vertices). (Hint: Use induction on n and consider the deletion of a single 
vertex.) (Tomescu [1971]) 


5.3.16. Let G be a simple graph with n vertices and m edges. Prove that G has at most 
1(7) triangles. Conclude that the coefficient of k"-? in x(G;k) is positive, unless G has 
at most one edge. (Hint: Use Theorem 5.3.10.) 


5.3.17. (*) Use the inclusion-exclusion principle to prove Theorem 5.3.10 directly. 


5.3.18. (!) Consider the chromatic polynomials of the graphs below. 

a) Without computing them, give a short proof that they are equal. 

b) Express this chromatic polynomial as the sum of the chromatic polynomials of 
two chordal graphs, and use this to give a one-line computation of it. 


5.3.19. (—) Let G be the graph obtained from Ke by subdividing one edge. Use the 
chromatic recurrence to Compute x(G;k) as a product of linear factors (factors of the 
form k — c;). Show that G is not a chordal graph. (Read [1975], Dmitriev [1980]) 
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5.3.20. Let G be a chordal graph. Use a simplicial elimination ordering of G to prove 
the following statements. 

a) G has at most n maximal cliques, with equality if and only if G has no edges. 
(Fulkerson—Gross [1965]) 

b) Every maximal clique of G containing no simplicial vertex of G is a separating set. 


5.3.21. The Szekeres-Wilf number of a graph G is 1 + maxycg46(H). Prove that a 
graph G is chordal if and only if in every induced subgraph the Szekeres—Wilf number 
equals the clique number. (Voloshin [1982]) 


5.3.22. Let k,(G) be the number of r-cliques in a connected chordal graph G. Prove that 
Dore (-D" 1k, (G) = 1. (Hint: Use induction on n(G). Note that the binomial formula 


(Appendix A) implies that }°,,.(—1/ (") = 0 when m € N,) 


5.3.23. Let S be the vertex set of a cycle in a chordal graph G. Prove that G has a 
cycle whose vertex set consists of all but one element of S. (Comment: When G has a 
spanning cycle and § Cc V(G), Hendry conjectured that G also has a cycle whose vertex 
set consists of S plus one vertex.) (Hendry [1990]) 


5.3.24. Let e be a edge of a cycle C in a chordal graph. Prove that e forms a triangle 
with a third vertex of C. 


5.3.25. Let Q be a maximal clique in a chordal graph G. Prove that if G— Q is connected, 
then Q contains a simplicial vertex. (Voloshin—Gorgos [1982]) 


5.3.26. Exercise 5.3.13 establishes the formula X(GUA;k) = sarc oreae when GQ #7 is 
a complete graph. 
a) Prove that the formula holds when GUHi is achordal graph regardless of whether 
GH is acomplete graph. 
b) Prove that if x is a vertex ina chordal graph G, then 
x(GIN(x)]; k — 1) 
x (GIN (x)]; k) 


ath 


x(G,k) = x(G- x; Bk 


(Comment: Part (b) allows the chromatic polynomial of a chordal graph to be computed 
via an arbitrary elimination ordering. For example, eliminating the central vertex of Ps 


yields x (Ps; k) = [k(k — pk" = = k(k — 1)*.) (Voloshin [1982]) 


5.3.27. (+) A minimal vertex separator in a graph G is a set S C V(G) that for some 
pair x, y is a minimal set whose deletion separates x and y. Every minimal separating 
set is a minimal vertex separator, but u, v below show that the converse need not hold. 
a) Prove that if every minimal vertex separator in G is a clique, then the same 
property holds in every induced subgraph of G. 
b) Prove that a graph G is chordal if and only if every minimal vertex separator is 
a clique. (Dirac [1961]) 


y 


5.3.28. (!) Let G be an interval graph. Prove that G is a chordal graph and that G isa 
comparability graph. 
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5.3.29. Determine the smallest imperfect graph G such that x(G) = w(G). 


5.3.30. An edge in an acyclic orientation of G is dependent if reversing it yields a cycle. 
a) Prove that every acyclic orientation of a connected n-vertex graph has at least 
n — 1 independent edges. 
b) Prove that if x(G) is less than the girth of G, then G has an orientation with no 
dependent edges. (Hint: Use the technique in the proof of Theorem 5.1.21.) 


5.3.31. («) The number a(G) of acyclic orientations of G satisfies the recurrence a(G) = 
a(G — e) + a(G - e) (Theorem 5.3.27). The number of spanning trees of G appears to 
satisfy the same recurrence; does the number of acyclic orientations of G always equal 
the number of spanning trees? Why or why not? 


5.3.32. («) Let D be an acyclic orientation of G, and let f be a coloring of V(G) from 
the set [k]. We say that (D, f) is a compatible pair if u — vin D implies f(u) < f(v). 
Let n(G; k) be the number of compatible pairs. Prove that n(G;k) = (—1)" x(G; k). 
(Stanley [1973]) 
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Chapter 6 


Planar Graphs 


6.1. Embeddings and Euler’s Formula 


Topological graph theory, broadly conceived, is the study of graph layouts. 
Initial motivation involved the famous Four Color Problem: can the regions 
of every map on a globe be colored with four colors so that regions sharing a 
nontrivial boundary have different colors? Later motivation involves circuit 
layouts on silicon chips. Wire crossings cause problems in layouts, so we ask 
which circuits have layouts without crossings. 


DRAWINGS IN THE PLANE 
The following brain teaser appeared as early as Dudeney [1917]. 


6.1.1. Example. Gas-water-electricity. Three sworn enemies A, B,C live in 
houses in the woods. We must cut paths so that each has a path to each of three 
utilities, which by tradition.are gas, water, and electricity. In order to avoid 
confrontations, we don’t want any of the paths to cross. Can this be done? This 
asks whether K3,3 can be drawn in the plane without edge crossings; we will 
give two proofs that it cannot. a 


A G B 


Arguments about drawings of graphs in the plane are based on the fact 
that every closed curve in the plane separates the plane into two regions (the 
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inside and the outside). In elementary graph theory, we take this as an intuitive 
notion, but the full details in topology are quite difficult. Before discussing a 
way to make the arguments precise for graph theory, we show informally how 
this result is used to prove impossibility for planar drawings. 


6.1.2. Proposition. K; and K3.3 cannot be drawn without crossings. 


Proof: Consider a drawing of K; or K33 in the plane. Let C be a spanning cycle. 
If the drawing does not have crossing edges, then C is drawn as a closed curve. 
Chords of C must be drawn inside or outside this curve. Two chords conflict if 
their endpoints on C occur in alternating order. When two chords conflict, we 
can draw only one inside C and one outside C. 

A 6-cycle in K33 has three pairwise conflicting chords. We can put at most 
one inside and one outside, so it is not possible to complete the embedding. 
When C is a 5-cycle in Ks, at most two chords can go inside or outside. Since 
there are five chords, again it is not possible to complete the embeddings. Hence 
neither of these graphs is planar. | 
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We need a precise notion of “drawing”. We have used curves for edges. Us- 
ing only curves formed from line segments avoids topological difficulties. These 
can approximate any curve well enough that the eye cannot tell the difference. 


6.1.3. Definition. A curve is the image of a continuous map from [0, 1] to R?. 
A polygonal curve is a curve composed of finitely many line segments. It 
is a polygonal u, v-curve when it starts at u and ends at v. 

A drawing of a graph G is a function f defined on V(G) U E(G) that 
assigns each vertex v a point f(v) in the plane and assigns each edge with 
endpoints u,v a polygonal f(u), f(v)-curve. The images of vertices are 
distinct. A pointin f(e)N f (e’) that is not acommon endpoint is 4 crossing. 


It is common to use the same name for a graph G and a particular drawing 
of G, referring to the points and curves in the drawing as the vertices and edges 
of G. Since the endpoint relation between the points and curves is the same 
as the incidence relation between the vertices and edges, the drawing can be 
viewed as a member of the isomorphism class containing G. ; 

By moving edges slightly, we can ensure that no three edges have a common 
internal point, that an edge contains no vertex except its endpoints, and that no 
two edges are tangent. If two edges cross more than once, then modifying them 
as shown below reduces the number of crossings; thus we also require that 
edges cross at most once. We consider only drawings with these properties. 
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6.1.4, Definition. A graph is planar if it has a drawing without crossings. 
Such a drawing is a planar embedding of G. A plane graph is a partic- 
ular planar embedding of a planar graph. 

A curve is closed if its first and last points are the same. It is simple 
if it has no repeated points except possibly first=last. 


A planar embedding of a graph cuts the plane into pieces. These pieces are 
fundamental objects of study. 


6.1.5. Definition. An open set in the plane is a set U € R® such that for 
every p € U, all points within some small distance from p belong to U. A 
region is an open set U that contains a polygonal u, v-curve for every pair 
u,v @U. The faces of a plane graph are the maximal regions of the plane 
that contain no point used in the embedding. 

A finite plane graph G has one unbounded face (also called the outer face). 
The faces are pairwise disjoint. Points p,q € R? lying in no edge of G are in the 
same face if and only if there is a polygonal p, q-curve that crosses no edge. 

In a plane graph, every cycle is embedded as a simple closed curve. Some 
faces lie inside it, some outside. This again relies on the fact that a simple 
closed curve cuts the plane into two regions. As we have suggested, this is not 
too difficult for polygonal curves. We present some detail of this case in order 
to explain how to compute whether a point is in the inside or the outside. This 
proof appears in Tverberg [1980]. 


6.1.6.* Theorem. (Restricted Jordan Curve Theorem) A simple closed polygo- 
nal curve C consisting of finitely many segments partitions the plane into 
exactly two faces, each having C as boundary. 


Proof: Because the list of segments is finite, nonintersecting segments cannot 
be arbitrarily close. Hence we can leave a face only by crossing C. As we follow 
C, the nearby points on our right are in a single face, and similarly for the points 
on the left. (There is a precise algebraic definition for “left” and “right” here.) 
If x ¢ C and y € C, the segment xy first intersects C somewhere, approaching 
it from the right or the left. Hence every point not along C lies in the same face 
with at least one of the two sets we have described. 

To prove that the points on the left and right lie in different faces, we 
consider rays in the plane. A ray emanating from a point p is “bad” ifit contains 
an endpoint of a segment of C. Since C has finitely many segments, there are 
finitely many bad rays from p. 
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Since the list of segments is finite, each good ray from p crosses C finitely 
often. As the direction changes, the number of crossings changes only at a 
bad direction. Before and after such a direction, the parity of the number of 
crossings is the same. We say that p is an even point when every good ray from 
p crosses C an even number of times; otherwise p is an odd point. 


p 


Given points x and y in the same face of C, let P be a polygonal x, y-curve 
that avoids C. Since C has finitely many segments, the endpoints of segments 
on P can be adjusted slightly so that the rays along segments on P are good for 
their endpoints. A segment of P belongs to a ray from one end that contains 
the other; both points have good rays in the same direction. Since the segment 
does not intersect C, the two points have the same parity. Hence every two 
points in the same face have the same parity 

Because the endpoints of a short segment intersecting C exactly once have 
opposite parity, there are two distinct faces. The even points and odd points 
form the outside face and the inside face, respectively. a 


DUAL GRAPHS 
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A map on the plane or the sphere can be viewed as a plane graph in which 
the faces are the territories, the vertices are places where boundaries meet, and 
the edges are the portions of the boundaries that join two vertices. We allow 
the full generality of loops and multiple edges. From any plane graph G, we 
can form a related plane graph called its “dual”. 


6.1.7. Definition. The dual graph G* of a plane graph G is a plane graph 
whose vertices correspond to the faces of G. The edges of G* correspond to 
the edges of G as follows: if e is an edge of G with face X on one side and 
face Y on the other side, then the endpoints of the dual edge e* < E(G*) are 
the vertices x, y of G* that represent the faces X, Y of G. The order in the 
plane of the edges incident to x € V(G*) is the order of the edges bounding 
the face X of G in a walk around its boundary. 


6.1.8. Example. Every planar embedding of K4 has four faces, and these pair- 
wise share boundary edges. Hence the dual is another copy of K4. 

Every planar embedding of the cube Q3 has eight vertices, 12 edges, and 
six faces. Opposite faces have no common boundary; the dual is a planar em- 
bedding of Ko..2, which has six vertices, 12 edges, and eight faces. 

Taking the dual can introduce loops and multiple edges. For example, let 
G be the paw, drawn below in bold edges as a plane graph. Its dual graph G* is 
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drawn in solid edges. Since G has four vertices, four edges, and two faces, G* 
has four faces, four edges, and two vertices. a 


6.1.9. Remark. 1) Example 6.1.8 shows that a simple plane graph may have 
loops and multiple edges in its dual. A cut-edge of G becomes a loop in G*, 
because the faces on both sides of it are the same. Multiple edges arise in the 
dual when distinct faces of G have more than one common boundary edge. 

2) Some arguments require more careful geometric description of the dual. 
For each face X of G, we place the dual vertex x in the interior of X, so each 
face of G contains one vertex of G*. For each edge e in the boundary of X, we 
draw a curve from x to a point on e; these do not cross. Each such curve meets 
another from the other side of e at the’same point on e to form the edge of G* 
that is dual to e. No other edges enter-X. Hence G* is a plane graph, and each 
edge of G* in this layout crosses exactly one edge of G. 

Such arguments lead to a proof that (G*)* is isomorphic to G if and only if 
G is connected (Exercise 18). Mathematicians often use the word “dual” in a 
setting when performing an operation twice returns the original object. a 


6.1.10. Example. Two embeddings of a planar graph may have nonisomorphic 
duals. Each embedding shown below has three faces, so in each case the dual 
has three vertices. In the embedding on the right, the dual vertex correspond- 
ing to the outside face has degree 4. In the embedding on the left, no dual 
vertex has degree 4, so the duals are not isomorphic. 

This does not happen with 3-connected graphs. Every 3-connected planar 
graph has essentially one embedding (see Exercise 8.2.45). | 


<4 be 


When a plane graph is connected, the boundary of each face is a closed walk. 
When the graph is not connected, there are faces whose boundary consists of 
more than one closed walk. 


6.1.11. Definition. The length of a face in a plane graph G is the total length 
of the closed walk(s) in G bounding the face. 


238 Chapter 6: Planar Graphs 


6.1.12. Example. A cut-edge belongs to the boundary of only one face, and it 
contributes twice to its length. Each graph in Example 6.1.10 has three faces. 
In the embedding on the left the lengths are 3, 6, 7; on the right they are 3, 4, 9. 
The sum of the lengths is 16 in each case, which is twice the number of edges. @ 


6.1.13. Proposition. If /(F;) denotes the length of face F; in a plane graph G, 
then 2e(G) = 0 1(F;). 


Proof: The face lengths are the degrees of the dual vertices. Since e(G) = e(G*), 
the statement 2e(G) = ) /1(F;) is thus the same as the degree-sum formula 
2e(G*) = >. dg+(x) for G*. (Both sums count each edge twice.) | 


Proposition 6.1.13 illustrates that statements about a connected plane 
graph becomes statements about the dual graph when we interchange the roles 
of vertices and faces. Edges incident to a vertex become edges bounding a face, 
and vice versa, so the roles of face lengths and vertex degrees interchange. 

We can also interpret coloring of G* in terms of G. The edges of G* repre- 
sent shared boundaries between faces of G. Hence the chromatic number of G* 
equals the number of colors needed to properly color the faces of G. Since the 
dual of the dual of a connected plane graph is the original graph, this means 
that four colors suffice to properly color the regions in every planar map if and 
only if every planar graph has chromatic number at most four. 

The Jordan Curve Theorem states that a simple closed curve cuts its in- 
terior from its exterior. In plane graphs, this duality between curve and cut 
becomes a duality between cycles and ponds. 


6.1.14. Theorem. Edges in a plane griph G form a cycle in G if and only if the 
‘corresponding dual edges form a bond in G*. 


Proof: Consider D C E(G). Suppose first that D is the edge set of a cycle in G. 
The corresponding edge set D* C E(G*) contains all dual edges joining faces 
inside D to faces outside D (the Jordan Curve Theorem implies that there is at 
least one of each). Thus D* contains an edge cut. 

If D contains a cycle and more, then D* contains an edge cut and more. 
If D contains no cycle in G, then it encloses no region (see Exercise 24a). It 
remains possible to reach the unbounded face of G from every other without 
crossing D. Hence G* — D* is connected, and D* contains no edge cut. 

Thus D* is a minimal edge cut if and only if D is a cycle. a 


The next remark yields an inductive proof of Theorem 6.1.14 (Exercise 19). 
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6.1.15. Remark. Deleting a non-cut edge of G has the effect of contracting an 
edge in G*, as two faces of G merge into one. Contracting a non-loop edge of G 
has the effect of deleting an edge in G*. Letting G be the central solid graph 
below, we have G — e on the left and G - e on the right. 

Note that to maintain this duality, we keep multiple edges and loops that 
arise from edge contraction in plane graphs. | 


Face boundaries allow us to characterize bipartite planar graphs. The 
characterization can also be proved by induction (Exercise 20). 


6.1.16. Theorem. The following are equivalent for a plane graph G. 
A) G is bipartite. 
B) Every face of G has even length. 
C) The dual graph G* is Eulerian. 


Proof: A => B. A face boundary consists of closed walks. Every odd closed walk _ 
contains an odd cycle. Therefore, in a bipartite plane graph the contributions 


to the length of faces are all even. 2 

B = A. Let C be a cycle in G. Since G has no crossings, C is laid out as 
a simple closed curve; let F be the region enclosed by C. Every region of G is 
wholly within F or wholly outside F. If;we sum the face lengths for the regions 
inside F, we obtain an even number, since each face length is even. This sum 
counts each edge of C once. It also counts each edge inside F twice, since each 
such edge belongs twice to faces in F. Hence the parity of the length of C is the 
same as the parity of the full sum, which is even. 

B  C. The dual graph G* is connected, and its vertex degrees are the face 
lengths of G. | 


Many questions we consider for general planar graphs can be answered 
rather easily for a special class of planar graphs. 


6.1.17. Definition. A graph is outerplanar if it has an embedding with every 
vertex on the boundary of the unbounded face. An outerplane graph is 
such an embedding of an outerplanar graph. 


240 Chapter 6: Planar Graphs 


The graph in Example 6.1.10 is outerplanar, but another embedding is 
needed to demonstrate this. 


6.1.18. Proposition. The boundary of the outer face a 2-connected outerplane 
graph is a spanning cycle. 

Proof: This boundary contains all the vertices. If it is not a cycle, then it passes 

through some vertex more than once. Such a vertex would be a cut-vertex. 


6.1.19. Proposition. K, and K23 are planar but not outerplanar. 


Proof: The figure below shows that K, and Ko3 are planar. 

To show that they are not outerplanar, observe that they are 2-connected. 
Thus an outerplane embedding requires a spanning cycle. There is no spanning 
cycle in K23, since it would be a cycle of length 5 in a bipartite graph. 

There is a spanning cycle in K4, but the endpoints of the remaining two 
edges alternate along it. Hence these chords conflict and cannot both be drawn 


inside. Drawing a chord outside separates a vertex from the outer face. | 
6.1.20. Proposition. Every simple outerplanar graph has a vertex of degree 


at most 2. Bb 


Proof: It suffices to prove the statement for connected graphs. We use induc- 
tion on n(G); when n(G) < 3, every vertex has degree at most 2. For n(G) > 4, 
we prove the stronger statement that G has two nonadjacent vertices of degree 
at most 2. 

Basis step (n(G) = 4): Since K, is not outerplanar, G has nonadjacent 
vertices, and two nonadjacent vertices have degree at most 2. 

Induction step (n(G) > 4): If G has a cut-vertex x, then each {x}-lobe of G 
has a vertex of degree at most 2 other than x, and these are nonadjacent in G. 

If G is 2-connected, then the outer face boundary is a cycle C. If C has 
no chords, then G is 2-regular. If xy is a chord of C, then the vertex sets of 
the two x, y-paths on C both induce outerplanar subgraphs. By the induction 
hypothesis, these subgraphs H, H’ contain vertices z, z’ of degree at most 2 that 
are not in {x, y} (this includes the case where H or H’ is K3). Since no chord of 
C can be drawn outside C or cross xy, we have z # z’. Thus z, z’ is the desired 
pair of vertices. a 


x 
x 
z z 
y 
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EULER’S FORMULA 


Euler’s Formula (n — e+ f = 2) is the basic counting tool relating vertices, 
edges, and faces in planar graphs. 


6.1.21. Theorem. (Euler [1758]): If a connected plane graph G has exactly n 
vertices, e edges, and f faces, thenn —e + f =2. 


Proof: We use induction on n. Basis step (n = 1): G is a “bouquet” of loops, 
each a closed curve in the embedding. If e = 0, then f = 1, and the formula 
holds. Each added loop passes through a face and cuts it into two faces (by the 
Jordan Curve Theorem). This augments the edge count and the face count each 
by 1. Thus the formula holds when n = 1 for any number of edges. 

Induction step (n > 1): Since G is connected, we can find an edge that is 
not a loop. When we contract such an edge, we obtain a plane graph G’ with 
n’ vertices, e’ edges, and f’ faces. The contraction does not change the number 
of faces (we merely shortened boundaries), but it reduces the number of edges 
and vertices by 1, son’ =n —1, e’ =e — 1, and f’ = f. Applying the induction 
hypothesis yields 


n-e+f=n'+1-(e 


==> 


n=1 
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6.1.22. Remark. 1) By Euler’s Formula, all planar embeddings of a connected 
graph G have the same number of faces. Although the dual may depend on the 
embedding chosen for G, the number of vertices in the dual does not. 

2) Euler’s Formula as stated fails for disconnected graphs. Ifa plane graph 
G has k components, then adding k—1 edges to G yields a connected plane graph 
without changing the number of faces. Hence Euler’s Formula generalizes for 
plane graphs with k components as n — e+ f =k +1 (for example, consider a 
graph with n vertices and no edges). | 


Euler’s Formula has many applications, particularly for simple plane 
graphs, where all faces have length at least 3. 


6.1.23. Theorem. If G is a simple planar graph with at least three vertices, 
then e(G) < 3n(G) — 6. If also G is triangle-free, then e(G) < 2n(G) — 


Proof: It suffices to consider connected graphs; otherwise we could add edges. 
Euler’s Formula will relate n(G) and e(G) if we can dispose of f. 

Proposition 6.1.13 provides an inequality between e and f. Every face 
boundary in a simple graph contains at least three edges (if n(G) > 3). Letting 
{f;} be the list of face lengths, this yields 2e = >° f; => 3f. Substituting into 
n—e+f =2yieldse < 3n—-6. 
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When G is triangle-free, the faces have length at least 4. In this case 
2e = > f; => 4f, and we obtain e < 2n — 4. o 
6.1.24. Example. Nonplanarity of Ks and K3,3 follows immediately from The- 
orem 6.1.23. For Ks, we have e = 10 > 9 = 3n — 6. Since K33 is triangle-free, 
we have e = 9 > 8 = 2n — 4. These graphs have too many edges to be planar. § 


6.1.25. Definition. A maximal planar graph is a simple planar graph that 
is not a spanning subgraph of another planar graph. A triangulation is 
a simple plane graph where every face boundary is a 3-cycle. 


6.1.26. Proposition. For a simple n-vertex plane graph G, the following are 
equivalent. 
A) G has 3n — 6 edges. 
B) G is a triangulation. 
C) G is a maximal plane graph. 


Proof: A + B. For a simple n-vertex plane graph, the proof of Theorem 6.1.23 
shows that having 3n — 6 edges is equivalent to 2e = 3f, which occurs if and 
only if every face is a 3-cycle. 

B + C. There is a face that is longer than a 3-cycle if and only if there is a 
way to add an edge to the drawing and obtain a larger simple plane graph. m& 


6.1.27. Remark. A graph embeds in the plane if and only if it embeds on a 
sphere. Given an embedding on a sphere, we can puncture the sphere inside a 
face and project the embedding onto a plane tangent to the opposite point. This 
yields a planar embedding in which the punctured face on the sphere becomes 
the unbounded face in the plane. The process is reversible. | 


6.1.28. Application. Regular polyhedra. Informally, we think of a regular 
polyhedron as a solid whose boundary consists of regular polygons of the same 
length, with the same number of faces meeting at each vertex. When we expand 
the polyhedron out to a sphere and then lay out the drawing in the plane as in 
Remark 6.1.27, we obtain a regular plane graph with faces of the same length. 
Hence the dual also is a regular graph. 

Let G be a plane graph with n vertices, e edges, and f faces. Suppose 
that G is regular of degree k and that all faces have length /. The degree-sum 
formula for G and for G* yields kn = 2e =/f. By substituting for n and f in 
Euler’s Formula, we obtain e(? —1+ 2) = 2. Since e and 2 are positive, the 
other factor must also be positive, which yields (2/k) + (2/1) > 1, and hence 
21 + 2k > kl. This inequality is equivalent to (k — 2)(J — 2) < 4. 

Because the dual of a 2-regular graph is not simple, we require that k, / > 3. 
Now (k — 2)(/ — 2) < 4 also requires k,/ < 5. The only integer pairs satisfying 
these requirements for (k, 1) are (3, 3), (3, 4), (3, 5), (4, 3), and (5, 3). 

Once we specify k and |, there is only one way to lay out the plane graph 
when we start with any face. Hence there are only the five Platonic solids listed 
below, one for each pair (k, /) that satisfying the requirements. | 
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k Ll (k-2)¢0-2) e on f name 

3 3 1 6 4 4. tetrahedron 

3 4 2 12 8 6. cube 

4 3 2 12 6 8 _ octahedron 

3 5 3 30 20 12 dodecahedron 

5 3 3 30 12 20 icosahedron 
EXERCISES 


6.1.1., (—) Prove or disprove: 
a) Every subgraph of a planar graph is planar. 
b) Every subgraph of a nonplanar graph is nonplanar. 


6.1.2. (—) Show that the graphs formed by deleting one edge from K; and K3,3 are planar. 
6.1.3. (—) Determine all r,s such that K,, is planar. 


6.1.4. (—) Determine the number of isomorphism classes of planar graphs that can be 
obtained as planar duals of the graph below 


ook 


~ 


6.1.5. (—) Prove that a plane graph has a cut-vertex if and only if its dual has a cut- 
vertex. 


6.1.6. (—) Prove that a plane graph is 2-connected if and only if for every face, the 
bounding walk is a cycle. 


6.1.7. (—) Amaximal outerplanar graph is a simple outerplanar graph that is not a 
spanning subgraph of a larger simple outerplanar graph. Let G be a maximal outerpla- 
nar graph with at least three vertices. Prove that G is 2-connected. 


6.1.8. (—) Prove that every simple planar graph has a vertex of degree at most 5. 


6.1.9. (—) Use Theorem 6.1.23 to prove that every simple planar graph with fewer than 
12 vertices has a vertex of degree at most 4. 


6.1.10. (—) Prove or disprove: There is no simple bipartite planar graph with minimum 
degree at least 4. 


6.1.11. (—) Let G be a maximal planar graph. Prove that G* is 2-edge-connected and 
3-regular. 


6.1.12. (—) Draw the five regular polyhedra as planar graphs. Show that the octahedron 
is the dual of the cube and the icosahedron is the dual of the dodecahedron. 
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6.1.13. Find a planar embedding of the graph below. 


6.1.14. Prove or disprove: For each n € N, there is a simple connected 4-regular planar 
graph with more than n vertices. 


6.1.15. Construct a 3-regular planar graph of diameter 3 with 12 vertices. (Comment: 
T. Barcume proved that no such graph has more than 12 vertices.) 


6.1.16. Let F be a figure drawn continuously in the plane without retracing any seg- 
ment, ending at the start (this can be viewed as an Eulerian graph). Prove that F can 
be drawn without allowing the pencil point.to cross what has already been drawn. For 
example, the figure below has two traversals; one crosses itself and the other does not. 
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6.1.17. Prove or disprove: If G is a 2-connected simple plane graph with minimum 
degree 3, then the dual graph 'G* is simple. 


6.1.18. Given a plane graph G, draw the dual graph G* so that each dual edge intersects 
its corresponding edge in G and no other edge. Prove the following. 

a) G* is connected. 

b) If G is connected, then each face of G* contains exactly one vertex of G. 

c) (G*)* = G if and only if G is connected. 


6.1.19. Let G be a plane graph. Use induction on e(G) to prove Theorem 6.1.14: a set 
D © EG) is a cycle in G if and only if the corresponding set D* C E(G*) is a bond in 
G*. (Hint: Contract an edge of D and‘apply Remark 6.1.15.) 


6.1.20. Prove by induction on the number of faces that a plane graph G is bipartite if 
and only if every face has even length. 


6.1.21. (!) Prove that a set of edges in a connected plane graph G forms a spanning tree 
of G if and only if the duals of the remaining edges form a spanning tree of G*. 


6.1.22. The weak dual of a plane graph G is the graph obtained from the dual G* 
by deleting the vertex for the unbounded face of G. Prove that the weak dual of an 
outerplane graph is a forest. 


6.1.23. (!) Directed plane graphs. Let G be a plane graph, and let D be an orientation of 
G. The dual D* is an orientation of G* such that when an edge of D is traversed from 
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tail to head, the dual edge in D* crosses it from right to left. For example, if the solid 
edges below are in D, then the dashed edges are in D*. 


Prove that if D is strongly connected, then D* has no cycle, and 5~(D*) = 5*(D*) = 0. 
Conclude that if D is strongly connected, then D has a face on which the edges form a 
clockwise cycle and another face on which the edges form a counterclockwise cycle. 


6.1.24. (!) Alternative proof of Euler’s Formula. 

a) Use polygonal curves (not Euler’s Formula) to prove by induction on n(G) that 
every planar embedding of a tree G has one face. 

b) Prove Euler’s Formula by induction on the number of cycles. 


6.1.25. (!) Prove that every n-vertex plane graph isomorphic to its dual has 2n — 2 edges. 
For all n > 4, construct a simple n-vertex plane graph isomorphic to its dual. 


6.1.26. Determine the maximum number of edges in a simple outerplane graph with n 
vertices, giving three proofs. 

a) By induction on n. 

b) By using Euler’s Formula. 

c) By adding a vertex in the unbounded face and using Theorem 6.1.23. 


6.1.27. Let G be a connected 3-regular plane graph in which every vertex lies on one 
face of length 4, one face of length 6, and one face of length 8. 

a) In terms of n(G), determine the number of faces of each length. 

b) Use Euler’s Formula and part (a) to:determine the number of faces of G. 


6.1.28. Let C be a closed curve bounding a:convex region in the plane. Suppose that m 
chords of C are drawn so that no three share a point and no two share an endpoint. Let 
p be the number of pairs of chords that cross. In terms of m and p, compute the number 
of segments and the number of regions formed inside C. (Alexanderson—Wetzel [1977]) 


6.1.29. Prove that the complement of a simple planar graph with at least 11 vertices is 
nonplanar. Construct a self-complementary simple planar graph with 8 vertices. 


6.1.30. (!) Let G be an n-vertex simple planar graph with girth k. Prove that G has at 
most (n — 2) os edges. Use this to prove that the Petersen graph is nonplanar. 


6.1.31. Let G be the simple graph with vertex set v)....,v, whose edges are 
{v;v;: |i — j| < 3}. Prove that G is a maximal planar graph. 


6.1.32. Let G be a maximal planar graph. Prove that if S is a separating 3-set of G’*, 
then G* — S has two components. (Chappell) 


6.1.33. (!) Let G be a triangulation, and let n; be the number of vertices of degree i in 
G. Prove that }°(6 —i)n; = 12. 


6.1.34. Construct an infinite family of simple planar graphs with minimum degree 5 
such that each has exactly 12 vertices of degree 5. (Hint: Modify the dodecahedron. ) 


6.1.35. (!) Prove that every simple planar graph with at least four vertices has at least 
four vertices with degree less than 6. For each even value of n with n > 8, construct an 
n-vertex simple planar graph G that has exactly four vertices with degree less than 6. 
(Grtinbaum—Motzkin [1963]) 
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6.1.36. Let S be a set of n points in the plane such that for all x, y € S, the distance in 
the plane between x and y is at least 1. Prove that there are at most 3n — 6 pairs u, v in 
S such that the distance in the plane between u and v is exactly 1. 


6.1.37. Given integers k > 2,/ > 1, and k/ even, construct a planar graph with exactly 
k faces in which every face has length /. 


6.2. Characterization of Planar Graphs 


Which graphs embed in the plane? We have proved that Ks; and K33 do 
not. In fact, these are the crucial graphs and lead to a characterization of pla- 
nar graphs known as Kuratowski’s Theorem. Kasimir Kuratowski once asked 
Frank Harary about the origin of the notation for K5; and K33. Harary replied, 
“The K in Ks stands for Kasimir, and the K in K33 stands for Kuratowski!” 

Recall that a subdivision of a graph is a graph obtained from it by replacing 
edges with pairwise internally-disjoint paths (Definition 5.2.19). 


a subdivision of K3 3 


6.2.1. Proposition. If a graph G has a subgraph that is a subdivision of Ks or 
K3,3, then G is nonplanar. 


Proof: Every subgraph of a planar graph is planar, so it suffices to show that 
subdivisions of Ks and K3.3 are nonplanar. Subdividing edges does not affect 
planarity; the curves in an embedding of a subdivision of G can be used to 
obtain an embedding of G, and vice versa. | 


By Proposition 6.2.1, avoiding subdivisions of K; and K33 is a necessary 
condition for being a planar graph. Kuratowski proved TONCAS: 


6.2.2. Theorem. (Kuratowski [1930]) A graph is planar if and only if it does 
not contain a subdivision of Ks or K3 3. | 


Kuratowski’s Theorem is our goal in the first half of this section, after 
which we will comment on other characterizations of planar graphs. 

When G is planar, we can seek a planar embedding with additional prop- 
erties. Wagner [1936], Fary [1948], and Stein [1951] showed that every finite 
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simple planar graph has an embedding in which all edges are straight line seg- 
ments; this is known as Fary’s Theorem (Exercise 6). For 3-connected planar 
graphs, we will prove the stronger property that there exists an embedding in 
which every face is a convex polygon. 


PREPARATION FOR KURATOWSK?’S THEOREM 


We introduce short names for subgraphs that demonstrate nonplanarity. 


6.2.3. Definition. A Kuratowski subgraph of G is a subgraph of G that is 
a subdivision of Ks or K33. A minimal nonplanar graph is a nonplanar 
graph such that every proper subgraph is planar. 


We will prove that a minimal nonplanar graph with no Kuratowski sub- 
graph must be 3-connected. Showing that every 3-connected graph with no Ku- 
ratowski subgraph is planar then completes the proof of Kuratowski’s Theorem. 


6.2.4. Lemma. If F is the edge set of a face in a planar embedding of G, then 
G has an embedding with F being the edge set of the unbounded face. 


Proof: Project the embedding onto the sphere, where the edge sets of regions 
remain the same and all regions are bounded, and then return to the plane by 
projecting from inside the face bounded by F. | 


6.2.5. Lemma. Every minimal nonplanar graph is 2-connected. 


Proof: Let G be a minimal nonplanar graph. If G is disconnected, then we 
embed one component of G inside one face of an embedding of the rest. 

If G has a cut-vertex v, let G;,..., G, be the {v}-lobes of G. By the mini- 
mality of G, each G; is planar. By Lemma 6.2.4, we can embed each G; with v 
on the outside face. We squeeze each embedding to fit in an angle smaller than 
360/k degrees at v, after which we combine the embeddings at v to obtain an 
embedding of G. a 


6.2.6. Lemma. Let S = {x, y} be a separating 2-set of G. If G is nonplanar, 
then adding the edge xy to some S-lobe of G yields a nonplanar graph. 


Proof: Let G1,..., G, be the S-lobes of G, and let H,; = G; Uxy. If H; is planar, 
then by Lemma 6.2.4 it has an embedding with xy on the outside face. For each 
i > 1, this allows H; to be attached to an embedding of Ulin H; by embedding 
H; in a face that has xy on its boundary. Afterwards, deleting the edge xy if it 
is not in G yields a planar embedding of G. | 


The next lemma allows us to restrict our attention to 3-connected graphs 
in order to prove Kuratowski’s Theorem. The hypothesized graph doesn’t exist, 
but if it did, it would be 3-connected. 
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6.2.7. Lemma. If G is a graph with fewest edges among all nonplanar graphs 
without Kuratowski subgraphs, then G is 3-connected. 


Proof: Deleting an edge of G cannot create a Kuratowski subgraph in G. The 
hypothesis thus guarantees that deleting one edge produces a planar subgraph, 
and hence G is a minimal nonplanar graph. By Lemma 6.2.5, G is 2-connected. 
Suppose that G has a separating 2-set § = {x, y}. Since G is nonplanar, 
the unioi of xy with some S-lobe is nonplanar (Lemma 6.2.6); let H be such a 
graph. Since H has fewer edges than G, the minimality of G forces H to have 
a Kuratowski subgraph F’. All of F appears in G except possibly the edge xy. 
Since S is a minimal vertex cut, both x and y have neighbors in every S- 
lobe. Thus we can replace xy in F with an x, y-path through another S-lobe 
to obtain a Kuratowski subgraph of G. This contradicts the hypothesis that G 
has no Kuratowski subgraph, so G has no separating 2-set. | 


CONVEX EMBEDDINGS 


legram.me/math_books 


To complete the proof of Kuratowski’s Theorem, it suffices to prove that 
3-connected graphs without Kuratowski subgraphs are planar. We will use 
induction. In order to facilitate the proof of the induction step, it is helpful to 
prove a stronger statement. 


6.2.8. Definition. A convex embedding of a graph is a planar embedding in 
which each face boundary is a convex polygon. 


Tutte [1960, 1963] proved that every 3-connected planar graph has a convex 
embedding. This is best possible in terms of connectivity, since for n > 4 the 2- 
connected planar graph K2,, has no convex embedding. We follow Thomassen’s 
approach to proving Kuratowski’s Theorem by proving Tutte’s stronger conclu- 
sion for 3-connected graphs without Kuratowski subgraphs. (Another proof of 
Tutte’s result is based on ear decompositions—Kelmans [2000].) 

We prove this theorem of Tutte by induction on n(G). The paradigm for 
proving conditional statements by induction (Remark 1.3.25) tells us what lem- 
mas we need. Our hypotheses are “3-connected” and “no Kuratowski subgraph”; 
our conclusion is “convex embedding”. For a graph G satisfying the hypothe- 
ses, we need to find a smaller graph G’ that satisfies both hypotheses in order 
to apply the induction hypothesis. 
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The first lemma allows us to obtain a smaller 3-connected graph G’ by 
contracting some edge in G. The second shows that G’ will also satisfy the hy- 
pothesis of having no Kuratowski subgraph. The proof will then be completed 
by obtaining a convex embedding of G from a convex embedding of G’. 


6.2.9. Lemma. (Thomassen [1980]) Every 3-connected graph G with at least 
five vertices has an edge e such that G - e is 3-connected. 


Proof: We use contradiction and extremality. Consider an edge e with end- 
points x, y. If G - e is not 3-connected, then it has a separating 2-set S. Since 
G is 3-connected, S must include the vertex obtained by shrinking e. Let z de- 
note the other vertex of S and call it the mate of the adjacent pair x, y. Note 
that {x, y, z} is a separating 3-set in G. 

Suppose that G has no edge whose contraction yields a 3-connected graph, 
so every adjacent pair has a mate. Among all the edges of G, choose e = xy 
and their mate z so that the resulting disconnected graph G — [x, y,z} has a 
component H with the largest order. Let H’ be another component of G — 
{x, y, z} (see the figure below). Since {x, y, z} is a minimal separating set, each 
of x, y,z has a neighbor in each of H, H’. Let u be a neighbor of z in H’, and let 
v be the mate of u, z. 

By the definition of “mate”, G —{z, u,v} is disconnected. However, the 
subgraph of G induced by V(H) U {x,y} is connected. Deleting v from this 
subgraph, if it occurs there, cannot disconnect it, since then G — {z, v} would 
be disconnected. Therefore, Gy(#)u.,y}— v is contained in a component of G — 
{z, u, v} that has more vertices than H ,-which contradicts the choice of x, y, z. ™ 


Next we need to show that edge contraction preserves the absence of Ku- 
ratowski subgraphs. We introduce a convenient term: the branch vertices in 
a subdivision H’ of H are the vertices of degree at least 3 in H’. 


6.2.10. Lemma. If G has no Kuratowski subgraph, then also G - e has no 
Kuratowski subgraph. 


Proof: We prove the contrapositive: If G - e contains a Kuratowski subgraph, 
then so does G. Let z be the vertex of G - e obtained by contracting e = xy. If z 
is not in H, then H itself is a Kuratowski subgraph of G. If z € V(A) but zis 
not a branch vertex of H, then we obtain a Kuratowski subgraph of G from H 
by replacing z with x or y or with the edge xy. 

Similarly, if z is a branch vertex in H and at most one edge incident to z in 


250 Chapter 6: Planar Graphs 


A is incident to x in G, then expanding z into xy lengthens that path, and y is 
the corresponding branch vertex for a Kuratowski subgraph in G. 

In the remaining case (shown below), H is a subdivision of Ks and z is a 
branch vertex, and the four edges incident to z in H consist of two incident to 
x and two incident to y in G. In this case, let u1, ug be the branch vertices of 
H that are at the other ends of the paths leaving z on edges incident to x in G, 
and let v1, ve be the branch vertices of H that are at the other ends of the paths 
leaving z on edges incident to y in G. By deleting the 11, up-path and v1, ve-path 
from H, we obtain a subdivision of K33 in G, in which y, uw, ug are the branch 
vertices for one partite set and x, v1, ve are the branch vertices of the other. 


uy v1 uy V1 
> 
v9 ug v2 ug 
Ks subdivision in G’ K33 subdivision in G 


Now we can prove Tutte’s Theorem. 


0ks 


6.2.11. Theorem. (Tutte [1960, 1963) If G is a 3-connected graph with no 
subdivision of Ks or K3,3, then G has a convex embedding in the plane with 
no three vertices on a line. 2 


Proof: (Thomassen [1980, 1981]) We use induction on n(G). 

Basis step: n(G) < 4. The only 3-connected graph with at most four ver- 
tices is K,, which has such an embedding. 

Induction step: n(G) > 5. Let e be an edge such that G -e is 3-connected, as 
guaranteed by Lemma 6.2.9. Let z be the vertex obtained by contracting e. By 
Lemma 6.2.10, G-e has no Kuratowski subgraph. By the induction hypothesis, 
we obtain a convex embedding of H = G - e with no three vertices on a line. 

In this embedding, the subgraph obtained by deleting the edges incident 
to z has a face containing z (perhaps unbounded). Since H — z is 2-connected, 
the boundary of this face is a cycle C. All neighbors of z lie on C; they may be 
neighbors in G of x or y or both, where x and y are the original endpoints of e. 

The convex embedding of H includes straight segments from z to all its 
neighbors. Let x),...,x, be the neighbors of x in cyclic order on C. [If all 
neighbors of y lie in the portion of C from x; to x;,1, then we obtain a convex 
embedding of G by putting x at z in H and putting y at a point close to z in the 
wedge formed by xx; and xx;,1, as shown in the diagrams for Case 0 below. 

If this does not occur, then either 1) y shares three neighbors u, v, w with 
x, or 2) y has neighbors u, v that alternate on C with neighbors x;, x;,1 of x. In 
Case 1, C together with xy and the edges from {x, y} to {u, v, x} form a subdi- 
vision of Ks. In Case 2, C together with the paths uyv, x;xx;.1, and xy forma 


1 
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subdivision of K3.3. Since we are considering only graphs without Kuratowski 


subgraphs, in fact Case 0 must occur. | 
IN 
Case 0 Case 1 ue 2 


Together, Lemma 6.2.7 and Theorem 6.2.11 imply Kuratowski’s Theorem 
(Theorem 6.2.2). Fary’s Theorem can be obtained separately: if a graph has a 
planar embedding, then it has a straight-line planar embedding (Exercise 6). 

For applications in computer science, we want more—a straight-line planar 
embedding in which the vertices are located at the integer points in a relatively 
small grid. Schnyder [1992] proved that every n-vertex planar graph has a 
straight-line embedding in which the vertices are located at integer points in 
the grid [n — 1] x [n — 1]. g 

Many other characterizations of planar graphs have been proved; some are 
mentioned in the exercises. We describe two additional characterizations. 


6.2.12.* Definition. A graph H isa minor of a graph G if a copy of H can be 
obtained from G by deleting and/or contracting edges of G. 


For example, Ks is a minor of the Petersen graph, although the Petersen 
graph does not contain a subdivision of Ks. 


6.2.13.* Remark. Deletions and contractions can be performed in any order, 
as long as we keep track of which edge is which. Thus the minors of G can be 
described as “contractions of subgraphs of G”. 

If G contains a subdivision of H, say H’, then H also is a minor of G, ob- 
tained by deleting the edges of G not in H’ and then contracting edges incident 
to vertices of degree 2. If H has maximum degree at most 3, then H is a minor 
of G if and only if G contains a subdivision of H (Exercise 11). 

Wagner [1937] proved that a graph G is planar if and only if neither Ks nor 
K3,3 is a minor of G. Exercise 12 obtains this from Kuratowski’s Theorem. S&S 


6.2.14.* Remark. Some characterizations are more closely related to actual 
embeddings. For example, when a 3-connected graph is drawn in the plane, 
deleting the vertex set of a facial cycle leaves a connected subgraph. 

We say that a cycle in a graph is nonseparating if its vertex set is not a sep- 
arating set. Kelmans [1980, 1981b] proved that a subdivision of a 3-connected 
graph is planar if and only if every edge e lies in exactly two nonseparating 
cycles. Kelmans [1993] surveys related material. | 
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PLANARITY TESTING (optional) 


Dirac and Schuster [1954] gave the first short proof of Kuratowski’s The- 
orem. Appearing in Harary [1969, 109-112], Bondy—Murty [1976, p153—156], 
and Chartrand—Lesniak [1986, p96—98], it uses special subgraphs of a graph. 


6.2.15. Definition. When H is a subgraph of G, an H-fragment of G is either 
1) an edge not in H whose endpoints are in H, or 
2) a component of G — V(A) together with the edges (and vertices of 
attachment) that connect it to H. 


Together with the subgraph H itself, the H-fragments form a decomposi- 
tion of G. The H-fragments are the “pieces” that must be added to an embedding 
of H to obtain an embedding of G. Historically, the term “H-bridge” was used; 
we use “H-fragment” to avoid confusion with other uses of “bridge”. 

An H-fragment differs from a V(H)-lobe because the H-fragment omits 
the edges of H. Also, an H-fragment may be a single edge not in H but joining 
vertices of H, since H need not be an induced subgraph. 

For the 3-connected case of Kuratowski’s Theorem, Dirac and Schuster 
considered a minimal nonplanar 3-connected graph G with no Kuratowski sub- 
graph. Deleting an edge e yields a planar 2-connected graph. After choosing 
a cycle C through the endpoints of e, we can add e to the embedding unless 
there is a C-fragment embedded inside C and another embedded outside C 
that “conflict” with e. As in the proof of Theorem 6.2.11, this produces a Kura- 
towski subgraph of G. Tutte used the idea of conflicting C-fragments to obtain 
another characterization of planar graphs. 


0.2.16. Definition. Let C be a cycle in a graph G. Two C-fragments A, B con- 
flict if they have three common vertices of attachment to C or if there are 
four vertices v1, v2, v3, v4 in cyclic order on C such that v1, v3 are vertices 
of attachment of A and vo, v4 are vertices of attachment of B. The con- 
flict graph of C is a graph whose vertices are the C-fragments of G, with 
conflicting C-fragments adjacent. 


Tutte [1958] proved that G is planar if and only if the conflict graph of each 
cycle in G is bipartite (Exercise 13). We used this idea in our first proof that 
Ks and K33 are nonplanar (Proposition 6.1.2); the conflict graph of a spanning 
cycle in K33 is C3, and thé conflict graph of a spanning cycle in Ks is Cs. 

Nonplanar 3-connected graphs have Kuratowski subgraphs of a special 
type. Kelmans [1984a] conjectured this extension of Kuratowski’s Theorem, 
and it was proved independently by Kelmans [1983, 1984b] and Thomassen 
[1984]: Every 3-connected nonplanar graph with at least six vertices contains 
a cycle with three pairwise crossing chords. 

Characterizations of planarity lead us to ask whether we can test quickly 
whether a graph is planar. There are linear-time algorithms due to Hopcroft 
and Tarjan [1974] and to Booth and Luecker [1976], but these are very compli- 
cated (Gould [1988, p177—185] discusses the ideas used in the Hopcroft—Tarjan 
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algorithm). A simpler earlier algorithm is not linear but runs in polynomial 
time. Due to Demoucron, Malgrange, and Pertuiset [1964], it uses H-fragments. 

The idea is that if a planar embedding of H can be extended to a planar 
embedding of G, then in that extension every H-fragment of G appears inside 
a single face of H. We build increasingly larger plane subgraphs H of G that 
can be extended to an embedding of G if G is planar. We try to enlarge H by 
making small decisions that won’t lead to trouble. 

To enlarge H, we choose a face F that can accept an H-fragment B; the 
boundary of F must contain all vertices of attachment of B. Although we do 
not know the best way to embed B in F, a single path in B between vertices of 
attachment by itself has only one way to be added across F, so we add such a 
path. The details of choosing F and B appear below. Like the other algorithms 
mentioned, this algorithm produces an embedding if G is planar. 


6.2.17. Algorithm. (Planarity Testing) 

Input: A 2-connected graph. (Since G is planar if and only if each block of G 
is planar, and Algorithm 4.1.23 computes blocks, we may assume that G is a 
block with at least three vertices.) 

Idea: Successively add paths from current fragments. Maintain the vertex 
sets forming face boundaries of the subgraph already embedded. 
Initialization: Go is an arbitary cycle in G embedded in the plane, with two 
face boundaries consisting of its vertices. 

Iteration: Having determined G;, find G;41 as follows. 

1. Determine all G;-fragments of the input block G. 

2. For each G;-fragment B, determine all faces of G; that contain all ver- 
tices of attachment of B; call this set F(B). 

3. If F(B) is empty for some B, return NONPLANAR. If |F(B)| = 1 for 
some B, select such a B. If |F(B)| > 1 for every B, select any B. 

4. Choose a path P between two vertices of attachment of the selected B. 
Embed P across a face in F(B). Call the resulting graph G;,; and update the 
list of face boundaries. 

5. If G;41 = G, return PLANAR. Otherwise, augment i and return to Step 1. 


6.2.18. Example. Consider the two graphs below (from Bondy—Murty [1976, 
p165-166]). Algorithm 6.2.17 produces a planar embedding of the graph on the 
left, but it terminates in Step 3 for the graph on the right. The cycle 12348765 


has three pairwise crossing chords: 14, 27, 36. | 
8 1 8 1 
7 2 7 2 
6 3 6 3 
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6.2.19, Theorem. (Demoucron—Malgrange—Pertuiset [1964]) Algorithm 6.2.17 
produces a planar embedding if G is planar. 


Proof: We may assume that G is 2-connected. A cycle appears as a simple 
closed curve in every planar embedding. Since we can reflect the plane, every 
embedding of a cycle in a planar graph G extends to an embedding of G. 

Hence Go extends to a planar embedding of G if G is planar. It suffices to 
show that if the plane graph G; is extendable to a planar embedding of G and 
the algorithm produces a plane graph G;,; from G;, then G;,; also is extendable 
to a planar embedding of G. Note that every G;-fragment has at least two 
vertices of attachment, since G is 2-connected, 

If some G;-fragment B has |F(B)| = 1, then there is only one face of G; that 
can contain P in an extension of G; to a planar embedding of G. The algorithm 
puts P in that face to obtain G;,1, so in this case G;,; is extendable. 

Problems can arise only if |F(B)| > 1 for all B and we select the wrong face 
in which to embed a path P from the selected fragment. Suppose that (1) we 
embed P in face f € F(B), and (2) G; can be extended to a planar embedding 
G of G in which P is inside face f’ € F(B). We modify G to show that G; can 
be extended to another embedding G’ of G in which P is inside f. This shows 
that our chcice causes no problem, and the constructed G;,1 is extendable. 

Let C be the set of vertices in the boundaries of both f and f’; this includes 
the vertices of attachment of B. We draw G’ by switching between f and /’ all 

G;-fragments that G places in f or f’ and whose vertices of attachment lie in C. 
We show this on the left below, where edges of G not present in G; are dashed. 


ceram. 17 


The change switches B and produces the desired embedding G’ unless some 
unswitched G;-fragment B conflicts with a switched fragment. Since the switch 
is symmetric in f and f’ and changes only their interiors, we may assume that 
B appears in f in G. “Conflict” means that G has some B’ in f’, which we are 
trying to move to f, such that B and B’ are adjacent in the conflict graph of /. 

Let A, A’ denote the vertex sets where B, B’ attach to the boundary of f. 
‘Since B and B’ conflict, A, A’ have three common vertices or four alternating 
vertices on the boundary of f. Since A’ C C but A ¢ C, the first possibility 
implies the second. Let x,u, y, v be the alternation, with x, y € A’ C C and 
u,v € A. We may assume that u ¢ C, as shown on the right above; if there is 
no such alternation, then B, B’ do not conflict or B can switch to f’. 

Since u ¢ C and y is between u and v on f, no other face contains both u 
and v. Thus B fails to have its vertices of attachment contained in at least two 
faces, contradicting the hypothesis that | F (B)| > 1. a 
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We can begin by checking that G has at most 3n — 6 edges, maintain ap- 
propriate lists for the face boundaries, and perforin the other operations via 
searches of linear size. Thus this algorithm runs in quadratic time. The proof 
of Kuratowski’s Theorem by Klotz [1989] also gives a quadratic algorithm to 
test planarity, and it finds a Kuratowski subgraph when G is not planar. 


EXERCISES 


6.2.1. (—) Prove that the complement of the 3-dimensional cube Q3 is nonplanar. 


6.2.2.. (—) Give three proofs that the Petersen graph is nonplanar. 
a) Using Kuratowski’s Theorem. 
b) Using Euler’s Formula and the fact that the Petersen graph has girth 5. 
c) Using the planarity-testing algorithm of Demoucron—Malgrange—Pertuiset. 


6.2.3. (—) Find a convex embedding in the plane for the graph below. 


6.2.4. (—) For each graph below, prove nonplanarity or provide a convex embedding. 


6.2.5. Determine the minimum number of edges that must be deleted from the Petersen 
graph to obtain a planar subgraph. 


6.2.6. (!) Fary’s Theorem. Let R be a region in the plane bounded by a simple polygon 
with at most five sides (simple polygon means the edges are line segments that do not 
cross). Prove there is a point x inside R that “sees” all of R, meaning that the segment 
from x to any point of R does not cross the boundary of R. Use this to prove inductively 
that every simple planar graph has a straight-line embedding. 
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6.2.7. (!) Use Kuratowski’s Theorem to prove that G is outerplanar if and only if it has 
no subgraph that is a subdivision of K, or Ke.3. (Hint: To apply Kuratowski’s Theorem, 
find an appropriate modification of G. This is much easier than trying to mimic a proof 
of Kuratowski’s Theorem.) 


6.2.8. (!) Prove that every 3-connected graph with at least six vertices that contains a 
subdivision of Ks; also contains a subdivision of K33. (Wagner [1937]) 


6.2.9. (+) For n > 5, prove that the maximum number of edges in a simple planar n- 
vertex graph not having two disjoint cycles is 2n — 1. (Comment: Compare with Exercise 
5.2.28.) (Markus [1999]) 


6.2.10. (!) Let f(n) be the maximum number of edges in a simple n-vertex graph con- 
taining no K3 3-subdivision. 

a) Given that n — 2 is divisible by 3, construct a graph to show that f(n) > 3n — 5. 

b) Prove that f(n) = 3n — 5 when n — 2 is divisible by 3 and that otherwise f(n) = 
3n — 6. (Hint: Use induction on n, invoking Exercise 6.2.8 for the 3-connected case.) 
(Thomassen [1984]) 

(Comment: Mader [1998] proved the more difficult result that 3n — 6 is the maxi- 
mum number of edges in an n-vertex simple graph with no K5-subdivision.) 


6.2.11. (!) Let H be a graph with maximum degree at most 3. Prove that a graph G 
contains a subdivision of H if and only if G contains a subgraph contractible to H. 


6.2.12. (!) Wagner [1937] proved that the following condition is necessary and sufficient 
for a graph G to be planar: neither K; nor Ks, 3 can be obtained from G by performing 
deletions and contractions of edges. 
a) Show that deletion and contraction of edges preserve planarity. Conclude from 
this that Wagner’s condition is necessary. Z 
b) Use Kuratowski’s Theorem to prove *that Wagner’s condition is sufficient. 


qath_| 


6.2.18. Prove that a graph G is planar if and only if for every cycle C in G, the conflict 
graph for C is bipartite. (Tutte [1958]) 


6.2.14. Let x and y be vertices of a planar graph G. Prove that G has a planar em- 
bedding with x and y on the same face unless G — x — y has a cycle C with x and y 
in conflicting C-fragments in G. (Hint: Use Kuratowski’s Theorem. Comment: Tutte 
proved this without Kuratowski’s Theorem and used it to prove Kuratowski’s Theorem.) 


6.2.15. Let G be a 3-connected simple plane graph containing a cycle C. Prove that C 
is the boundary of a face in G if and only if G has exactly one C-fragment. (Comment: 
Tutte [1963] proved this to obtain Whitney’s [1933b] result that 3-connected planar 
graphs have essentially only one planar embedding. See also Kelmans [1981a]) 


6.2.16. (+) Let G be an outerplanar graph with n vertices, and let P be a set of n points 
in the plane, no three of which lie on a line. The extreme points of P induce a convex 
polygon that contains the other points in its interior. 

a) Let pi, p2 be consecutive extreme points of P. Prove that there is a point p € 
P —{p, p2} such that 1) no point of P is inside p; p2p, and 2) some line / through p sep- 
arates p; from p2, meets P only at p, and has exactly i — 2 points of P on the side of / 
containing pe. 

b) Prove that G has a straight-line embedding with its vertices mapped onto P. 
(Hint: Use part (a) to prove the stronger statement that if v1, ve are two consecutive 
vertices of the unbounded face of a maximal outerplanar graph G, and p,, pe are consec- 
utive vertices of the convex hull of P, then G can be straight-line embedded on P such 
that f(v1) = pi and f (ve) = pe.) (Gritzmann—Mohar—Pach—Pollack [1989]) 
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6.3. Parameters of Planarity 


Every property and parameter we have studied for general graphs can be 
studied for planar graphs. The problem of greatest historical interest is the 
maximum chromatic number of planar graphs. We will also study parameters 
that measure how far a graph is from being a planar graph. 


COLORING OF PLANAR GRAPHS 


Because every simple n-vertex planar graph has at most 3n — 6 edges, such 
a graph has a vertex of degree at most 5. This yields an inductive proof that 
planar graphs are 6-colorable (see Exercise 2). Heawood improved the bound. 


6.3.1. Theorem. (Five Color Theorem—Heawood [1890]) Every planar graph 
is 5-colorable. 


1_ books 


Proof: We use induction on n(G). 

Basis step: n(G) < 5. All such graghs are 5-colorable. 

Induction step: n(G) > 5. The edge bound (Theorem 6.1.23) implies that 
G has a vertex v of degree at most 5. By the induction hypothesis, G — v is 
5-colorable. Let f: V(G—v) — [5] be a_proper 5-coloring of G — v. If G is not 5- 
colorable, then f assigns each color to some neighbor of v, and hence d(v) = 5. 
Let v1, v2, v3, V4, vs be the neighbors of v in clockwise order around v. Name the 
colors so that f(v;) =i. 

Let G;,; denote the subgraph of G — v induced by the vertices of colors i 
and j. Switching the two colors on any component of G;,; yields another proper 
5-coloring of G — v. If the component of G; ; containing v; does not contain v,, 
then we can switch the colors on it to remove color i from N(v). Now giving 
color i to v produces a proper 5-coloring of G. Thus G is 5-colorable unless, for 
each choice of i and j, the component of G,,; containing v; also contains v;. Let 
P, ; be a path in G;,; from v; to v;, illustrated below for (i, j) = (1, 3). 


1 3 1 


Consider the cycle C completed with P;3 by v; this separates va from v4. 
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By the Jordan Curve Theorem, the path P24 must cross C. Since G is planar, 
paths can cross only at shared vertices. The vertices of P;,.3 all have color 1 or 
3, and the vertices of Po 4 all have color 2 or 4, so they have no common vertex. 

By this contradiction, G is 5-colorable. | 


Every planar graph is 5-colorable, but are five colors ever needed? The his- 
tory of this infamous question is discussed in Aigner [1984, 1987], Ore [1967a], 
Saaty—Kainen [1977, 1986], Appel—Haken [1989], and Fritsch—Fritsch [1998]. 
The earliest known posing of the Four Coior Problem is in a letter of October 
23, 1852, from Augustus de Morgan to Sir William Hamilton. The question was 
asked by de Morgan’s student Frederick Guthrie, who later attributed it to his 
brother Francis Guthrie. It was phrased in terms of map coloring. 

The problem’s ease of statement and geometric subtleties invite fallacious 
proofs; some were published and remained unexposed for years. It does not su‘- 
fice to forbid five pairwise-adjacent regions, since there are 5-chromatic graphs 
not containing Ks (recall Mycielski’s construction, for example). 

Cayley announced the problem to the London Mathematical Society in 
1878, and Kempe [1879] published a “solution”. In 1890, Heawood published a 
refutation. Nevertheless, Kempe’s idea of alternating paths, used by Heawood 
to prove the Five Color Theorem, led eventually to a proof by Appel and Haken 
[1976, 1977, 1986] (working with Koch). A path on which the colors alternate 
between two specified cclors is a Kempe chain. 

In proving the Five Color Theorem inductively, we argued that a minimal 
counterexample contains a vertex of degree at most 5 and that a planar graph 
with such a vertex cannot be a minimal counterexample. This suggests an 
approach to the Four Color Problem;:we seek an unavoidable set of graphs 
that can’t be present! We need only consider triangulations, since every simple 
planar graph is contained in a triangulation. 


6.3.2. Definition. A configuration in a planar triangulation is a separating 
cycle C (the ring) together with the portion of the graph inside C. For the 
Four Color Problem, a set of configurations is unavoidable if a nines 
counterexample must contain a member of it. A configuration is reducible 
if a planar graph containing it cannot be a minimal counterexample. 


6.3.3. Example. An unavoidable set. We have remarked that 5(G) < 5 for 
every simple planar graph. In a triangulation, every vertex has degree at least 
3. Thus the set of three configurations below is unavoidable. 


A W 


The edges from the ring to the interior are drawn with dashes because 
a configuration (in a triangulation) is completely determined if we state the 
degrees of the vertices adjacent to the ring and delete the ring (Exercise 7). 
Thus these configurations are written as “e 3”, “e 4”, and “e5”, respectively. 
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When we say that a configuration cannot be in a minimal counterexample, 
we mean that if it appears in a triangulation G, then it can be replaced to obtain 
a triangulation G’ with fewer vertices such that every 4-coloring of G’ can be 
manipulated to obtain a 4-coloring of G. 


6.3.4. Remark. Kempe’s proof. Let us try to prove the Four Color Theorem by 
induction using the unavoidable set {e 3, e4,e5}. The approach is similar to 
Theorem 6.3.1. We can extend a 4-coloring of G — v to complete a 4-coloring of G 
unless all four colors appear on N(v). Thus “e 3” is reducible. If d(v) = 4, then 
the Kernpe-chain argument works as in Theorem 6.3.1, and “e 4” is reducible. 

Now consider “e 5”. When d(v) = 5, the restriction to triangulations im- 
plies that the repeated color on N(v) in the proper 4-coloring of G — v appears 
on nonconsecutive neighbors of v. Let v1, v2, v3, v4, vs again be the neighbors of 
v in clockwise order. In the 4-coloring f of G — v, we may assume by symmetry 
that f(vs) = 2 and that f(v;) =i for 1 <i < 4. 

Define G;,; and P;,; as in Theorem 6.3.1. We can eliminate color 1 from 
N(v) unless the chains P;3 and P;4 exist from v; to v3 and v4, respectively, as 
shown on the left below. The component H of G24 containing v2 is separated 
from v4 and vs by the cycle completed by v with P;,3. Also, the component H’ of 
Go,3 containing vs is separated from v2-and v3 by the cycle completed by v with 
P,.4. We can eliminate color 2 from N(v) by switching colors 2 and 4 in H and 
colors 2 and 3 in H’. Right? This was the final case in Kempe’s proof. 


« 1 
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The problem is that P;,3 and P;,4 can intertwine, intersecting at a vertex 
with color 1 as shown on the right above. We can make the switch in H or in 
H’, but making them both creates a pair of adjacent vertices with color2. & 


Because of this difficulty, we have not shown that “e5” is reducible, and 
we must consider larger configurations. Heesch [1969] contributed the idea of 
seeking configurations with small ring size instead of few vertices inside. It is 
not hard to show that every configuration having ring size 3 or 4 is reducible 
(Exercise 9). This is equivalent to showing that no minimal 5-chromatic trian- 
gulation has a separating cycle of length at most 4. 


6,3.5.* Example. Birkhoff [1913] pushed the idea farther. He proved that 
every configuration with ring size 5 that has more than one vertex inside is 
reducible. He also proved that the configuration with ring size 6 below, called 
the Birkhoff diamond, is reducible. 
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Proving that the Birkhoff diamond is reducible takes a full page of detailed 
analysis. One approach is to try to show that all proper 4-colorings of the ring 
extend to the interior. Although some cases can be combined, and some do 
extend, in some cases it is necessary to use Kempe chains to show that the 
coloring can be changed into one that extends. | 


The intricate analysis of this first nontrivial example suggests that we have 
barely begun. The detail remaining is enormous. From 1913 to 1950, additional 
reducible configurations were found, enough to prove that all planar graphs 
with at most 36 vertices are 4-colorable. This was slow progress. In the 1960s, 
Heesch focused attention on the size of the ring, gave heuristics for finding re- 
ducible configurations, and developed methods for generating unavoidable sets. 

The first proof used configurations. with ring size up to 14. A ring of size 13 
has 66430 distinguishable 4-colorings. Reducibility requires showing that each 
leads to a 4-coloring of the full graph. Kempe-chain arguments and partial col- 
lapsing of the configuration may be needed, so reducibility proofs are not easy. 

Appel and Haken, working with Koch, improved upon the heuristics of 
Heesch and others to restrict computer:searches to “promising” configurations. 
Using 1000 hours of computer time on three computers in 1976, they found an 
unavoidable set of 1936 reducible configurations, all with ring size at most'14. 


6.3.6. Theorem. (Four Color Theorem—Appel—Haken—Koch [1977]) Every pla- 
nar graph is 4-colorable. | 


By 1983, refinements led to an unavoidable set of 1258 reducible configu- 
rations. The proof was revisited by Robertson, Sanders, Seymour, and Thomas 
[1996], using the same approach. They reduced the rules used for producing un- 
avoidable sets to a set of 32 rules. Their simplifications yielded an unavoidable 
set of 633 reducible configurations. They made their computer code available 
on the Internet; in 1997, it would prove the Four Color Theorem on a desktop 
workstation in about three hours. 


6.3.7.* Remark. Discharging. To generate unavoidable sets, we replace the 
problem case (vertex of degree 5) by larger configurations involving a vertex of 
degree 5; this can be viewed as a more detailed case analysis for the hard case. 
Systematic rules are needed to maintain a reasonably small exhaustive set. 

In a triangulation, }' d(v) = 2e(G) = 6n — 12. We rewrite this as 12 = 
>¥>(6—d(v)) and think of 6—d(v) as a charge on vertex v. Because 12 is positive, 
some vertices must have positive charge (degree 5). The rules for replacing bad 
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‘cases involve moving the charge around; they are called discharging rules. 
Since positive charge must remain somewhere, we obtain new unavoidable sets. 
The next proposition describes the effect of the simplest discharging rule. UH 


6.3.8.* Proposition. Every planar triangulation with minimum degree 5 con- 
tains a configuration in the set below. 


5e—e 5 5 e—e 6 


Proof: Start with charge defined by 6 — d(v). The first discharging rule takes 
the charge from each vertex of positive charge (degree 5) and distributes that 
charge equally among its neighbors. 

A vertex of degree 5 or 6 now having positive charge must have a neighbor 
of degree 5. A vertex of degree 7 now having positive charge must have at least 
six neighbors of degree 5. Since G is a triangulation, this requires adjacent 
vertices of degree 5. No vertex of degree 8 or more can acquire positive charge 
from this discharging rule. 

The total charge in the graph remains 12, so some vertex v has positive 
charge. For each case of d(v), one of the specified configurations occurs. | 


ks 


Discharging methods are now being applied to attack other problems using 
computer-assisted analysis by cases. = 

The proof of the Four Color Theorem met with considerable uproar. Some 
objected in principle to the use of a computer. Others complained that the 
proof was too long to be verified. Others worried about computer error. A 
few errors were found in the original algorithms, but these were fixed (Appel— 
Haken [1986]). Those who have checked calculations by hand recognize that 
the probability of human error in a mathematical proof is much higher than 
the probability of computer error when the algorithm has been proved correct. 


th 


CROSSING NUMBER 


In the remainder of this section, we consider parameters that measure 
a graph’s deviation from planarity. One natural parameter is the number of 
planar graphs needed to form the graph; Exercises 16—20 consider this. 


6.3.9. Definition. The thickness of a graph G is the minimum number of 
planar graphs in a decomposition of G into planar graphs. 


6.3.10. Proposition. A simple graph G with n vertices and m edges has thick- 
ness at least m/(3n — 6). If G has no triangles, then it has thickness at 
least m/(2n — 4). 


Proof: By Theorem 6.1.23, the denominator is the maximum size of each planar 
subgraph. The pigeonhole principle then yields the inequality. | 
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Sometimes we simply must draw a graph in the plane, even if it is not a 
planar graph. For example, a circuit laid out on a chip corresponds to a draw- 
ing of a graph. Since wire crossings lessen performance and cause potential 
problems, we try to minimize the number of crossings. We discuss the resulting 
parameter in the remainder of this subsection. 


6.3.11. Definition. The crossing number v(G) of a graph G is the minimum 
number of crossings in a drawing of G in the plane. 


6.3.12. Example. v(Kg) = 3 and v(K3,.22) = 2. We can determine the crossing 
number of some small graphs by considering maximal planar subgraphs. Con- 
sider a drawing of G in the plane. If H is a maximal plane subgraph of this 
drawing, then every edge of G not in H crosses some edge of H, so the drawing 
has at least e(G) — e(H) crossings. If G has n vertices, then e(H) < 3n — 6. If 
also G has no triangles, then e(H) < 2n — 4. 

Since Kg has 15 edges, and planar 6-vertex graphs have at most 12 edges, 
we have v(Kg) = 3. The drawing on the left below proves equality. 

Since K3 22 has 16 edges, and planar graphs with seven vertices have at 
most 15 edges, v(K3.2.2) > 1. The best drawing we find has two crossings, as 
shown on the right below. To improve the lower bound, observe that K3,22 
contains K34. Because K3 4 is triangle-free, its planar subgraphs have at most 
2-7-4 = 10 edges, and hence v(K3,4)-> 2. Every drawing of K322 contains a 
drawing of K3,4, so v(K39.9) > v(K3,4) = 2. ] 
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6.3.13. Proposition. Let G be an n-vertex graph with m edges. If k is the 
maximum number of edges in a planar subgraph of G, then v(G) > m — k. 
Furthermore, v(G) > me — > 


Proof: Given a drawing of G in the plane, let H be a maximal subgraph of G 
whose edges do not cross in this drawing. Every edge not in H crosses at least 
one edge in H; otherwise, it could be added to H. Since H has at most k edges, 
we have at least m — k crossings between edges of H and edges of G — E(H). 

After discarding E(H), we have at least m — k edges remaining. The same 
argument yields at least (m — k) — k crossings in the drawing of the remaining 
graph. Iterating the argument yields at least )~;_,(m — ik) crossings, where 
t = |m/k]|. The value of the sum is mt — kt(t + 1)/2. 

We now write m = tkh+r,where0 <r <k-—1. We Paagdugs t=(m—r)/k 
in the value of the sum and simplify to obtain v(G) > me — 24 ee | 
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The first bound m — k in Proposition 6.3.13 is useful when G has few edges: 
the crossing number of a simple graph G is at least e(G) — 3n + 6, and when 
G is bipartite it is at least e(G) — 2n + 4. Iterating the argument improves the 
bound when e(G) is larger, but for dense graphs this lower bound is weak. 

Consider K,,, for example. Lacking an exact answer, we hope at least to 
determine the leading term in a polynomial expression for v(K,,). To indicate a 
polynomial of degree k in n with leading term an‘, we often write an‘ + O(n‘). 
This is consistent with the definition of “Big Oh” notation in Definition 3.2.3. 

Proposition 6.3.13 yields v(K,,) > qn? + O(n”), but actually v(K,) grows 
like a polynomial of degree 4. The crossing number cannot exceed (/), since we 
can place the vertices on the circumference of a circle and draw chords. For 
K,, each set of four vertices contributes exactly one crossing. Actually, this 
is the worst possible straight-line drawing of K,, since in every straight-line 
drawing, each set of four vertices contributes at most one crossing, depending 
on whether one vertex is inside the triangle formed by the other three. How 
many crossings can be saved by a better drawing? 


6.3.14. Theorem. (R. Guy [1972]) ¢n* + O(n) < v(K,) < nt + On’). 


Proof: A counting argument yields a recursive lower bound. A drawing of K,, 
with fewest crossings contains n drawings of K,_1, each obtained by deleting 
one vertex. Each subdrawing has at least v(K,_1 crossings. The total count is 
at least nv(K,_1), but each crossing in the full drawing has been counted (n — 4) 
times. We conclude that (n — 4)v(K,) = nv(Kn-1). 

From this inequality, we prove by-induction on n that v(K,) > +(1) when 
n > 5. Basis step: n = 5. The crossing number of K; is 1. Induction step: n > 5. 
Using the induction hypothesis, we compute 


n n l(n—1)(n—2)(n — 3)(n - 4) _ 1fn 
na 4’ An-1) = aS 24 =5 (4): 


The denominator of the quartic term in the lower bound can be improved 
from 120 to 80 by considering copies of Kg,,-6, which has crossing number 
6 | "8 | | 257 | (Exercise 26b). 

A better drawing lowers the upper bound from () to 4n*+ O(n). Consider 
n = 2k. Drawing K,, in the plane is equivalent to drawing it on a sphere or on 
the surface of a can. Place k vertices on the top rim of the can and k vertices on 
the bottom rim, drawing chords on the top and bottom for those k-cliques. 

The edges from top to bottom fall into k natural classes. The “class num- 
ber” is the circular separation between the top and bottom endpoints, ranging 
from | =4+*] to [45+]. We draw these edges to wind around the can as little as 
possible in passing from top to bottom, so edges in the same class don’t cross. 
We now twist the can to make the class displacements run from 1 to k. This 
makes them easier to count but doesn’t change the pairs of edges that cross. 

Crossings on the side of the can involve two vertices on the top and two on 
the bottom. For top vertices x, y and bottom vertices z, w, where xz has smaller 
positive displacement than xw, we have a crossing for x, y, z, w ifand only if the 
displacements to y, z, w are distinct positive values in increasing order. (For 


v(K,) = 
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example, this holds for x, y, z, w in the illustration, but not for x, y, z,a; the 
edge ya winds around the can.) Hence there are k(5) crossings on the side of 


the twisted can, and v(K,,) < 2(/ )+ k(5) = = gn*t + O(n). w 


6.3.15. Example. v(K,,,). The most naive drawing puts the vertices of one 
partite set on one side of a channel and the vertices of the other partite set 
on the other side, with all edges drawn straight across. This has (5) (3) cross- 
ings, but it is easy to reduce this by a factor of 4. Place the vertices of Kn, 
along two perpendicular axes. Put [n/2] vertices along the positive y-axis and 
[n/2] along the negative y-axis; similarly split the m vertices along the pos- 
itive and negative x-axis. Adding up the four types of crossings generated 
when we join every vertex on the x-axis to every vertex on the y-axis yields 
v(Kmn) < | # | ks m1| |e] [3 (Zarankiewicz [1954)]). 

This bound is conj jactured to be optimal (Guy [1969] tells the history). Kleit- 
man [1970] proved it for min{n, m} < 6. Aided by a computer search, Woodall 
[1993] extended this so that the smallest unknown cases are K7,; and Kgg9. 
From Kleitman’s result, Guy [1970] proved that v(Knn) > m1) [3/27]. 
which is not far from the upper bound (Exercise 26). r 


Another general lower bound for crossing number, conjectured in Erdés- 
Guy [1973], has an appealing geometric application. Our proof is inductive, 
generalizing the lower bound argument in Theorem 6.3.14. There is an elegant 
probabilistic proof in Exercise 8.5.1land stronger results in Pach—Toth [1997]. 


6.3.16.* Theorem. Ajtai-Chvatal- Newborn-Szemerédi [1982], Leighton [1983]) 
Let G be a simple graph. If e(G) > 4n(G), then v(G) > 4e(G) 3/n(G)?. 


Proof: Let m = e(G) and n = n(G). We use induction on n. 

Basis step: m < 5n (this anetude® all simple graphs with at most 11 ver- 
tices). Note that (a — 3) > aot? when 4 < a < 5. Letting m = an for 4 <a <5, 
we obtain v(G) > m — 3n > 4m*/n’, as desired. 

Induction step: n > 11. "Given an optimal drawing of G, each crossing ap- 
pears in n —4 of the drawings obtained by deleting a single vertex. By the induc- 
tion hypothesis, v(G — v) > 4 ay Thus (n — 4)v(G) > yo vev(G) 4 ay 

By convexity, the lower bound is always at least what results when the 
vertex degrees are all replaced by the average degree. In other words, )\(m — 
d(v))? > n(m — 2m/n)°. Also (n — 1)?(n — 4) < (n — 2)3. Thus 

343 
ee a 
(n—1)2(n—4)~ 64n 
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6.3.17.* Example. Achieving the bound. The order of magnitude in Theorem 
6.3.16 is best possible. Consider G = 5- Kom/n, where 2m is a multiple of n. The 
total number of vertices is n, and the total number of edges is asymptotic to 
nL (Bm 2 =m. Since v(K,) < grt, we have v(G) < ae 1 (2m)4 = Lm This is 
within a constant factor of the lower bound from Theorem 6.3.16. a 


We apply Theorem 6.3.16 to a problem in combinatorial geometry. Erdés 
[1946] asked how many unit distances can occur among a set of n points in the 
plane. If the points occur in a unit grid, then the graph of unit distances is the 
cartesian product of two paths, and this produces about n — O(./n) edges. By 
taking all the points of a refined grid that lie within an appropriate distance 
from the origin, Erdés obtained about n}+¢/!8!8" ynit distances. This growth 
rate is superlinear, but it is slower than n!** for each positive €. 

Erdés also proved an upper bound of O(n°/?). Since two circles of radius 1 
intersect in at most two points, the graph G of unit distances cannot contain 
Ko3. Thus each pair of points has at most two common neighbors. Since each 
vertex v is a common neighbor for its eu pairs of neighbors, >~ (“9”) < 2(5). 


Since 2e(G)/n is the average vertex degree, convexity yields ) (4%) > n(?@/”). 
Together, these inequalities yield the desired bound (Exercise 5.2.25 considers 
the edge-maximization problem in general when a biclique is forbidden). 
Using number-theoretic arguments about incidences between lines and 
points in a point set, Spencer-Szemerédi—Trotter [1984] improved the upper 
bound to O(n4/*), Székely applied Theorem 6.3.16 to give an elegant and short 
graph-theoretic proof of this bound. 
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6.3.18.* Theorem. (Spencer—Szemerédi—Trotter [1984]) There are at most 
4n*/3 pairs of points at distance 1 among a set of n points in the plane. 


Proof: (Székely [1997]) By moving points or pairs of points without reducing 
the number of pairs at distance 1, we can ensure that each point is involved in 
such a pair and that no two points have distance 1 only from each other. If any 
point now is involved in only one unit distance pair, we can rotate it around its 
mate until it is distance 1 from another point. This reduces the problem to the 
case that every point is involved in at least two such pairs. 

Let P be an optimal n-point configuration, with g unit distance pairs. We 
obtain a graph from P, not by using the unit distance pairs as edges, but rather 
by drawing a unit circle around each point. If a point in P is at distance 1 from 
k other points in P, then these poirits partition the circle into k arcs. Altogether 
we obtain 2g arcs. These are the edges of a loopless graph G. 

Since two points can appear on two (but not three) unit circles, G may have 
edges of multiplicity 2 but no larger multiplicity. We delete one copy of each 
duplicated edge to obtain a simple graph G’ with at least g edges. We may 
assume that g > 4n; otherwise the bound already holds. 

Because these arcs lie on n circles, they cannot produce many crossings; 
each pair of circles crosses at most twice. Thus our layout of G’ has at most 
2(5) crossings. By Theorem 6.3.16, G’ has at least 2,.q°/n” crossings. Together, 
these inequalities yield g < 4n*/?. 7 
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SURFACES OF HIGHER GENUS (optional) 


Instead of minimizing crossings in the plane, we could change the surface 
to avoid crossings. This is the effect of building overpasses and cloverleafs 
instead of installing traffic lights. The surface of the earth is a sphere, and for 
this discussion it is convenient to consider drawings on the sphere instead of in 
the plane. As observed in Remark 6.1.27, these settings are equivalent. 

To avoid creating boundaries in the surface, we add an overpass by cutting 
two holes in the sphere and joining the edges of the holes by a tube. By stretch- 
ing the tube and squeezing the rest of the sphere, we obtain a doughnut. 


6.3.19. Definition. A handle is < tube joining two holes cut in a surface. The 
torus is the surface obtained by adding one handle toa sphere. 


4 


S 
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The torus is topolegicaily the s as the sphere with one handle, in the 
sense that one surface can be continuously transformed into the other.t 

A large graph may have many crossings and need more handles. For any 
graph, adding enough handles to a drawing on the sphere will eliminate all 
crossings and produce an embedding. When we add some number of handles, 
it doesn’t matter how we do it, because a fundamental result of topology says 
that two surfaces obtained by adding the same number of handles to a sphere 
can be continuously deformed into each other. 


6.3.20. Definition. The genus of a surface obtained by adding handles to a 
sphere is the number of handles added; we use S, for the surface of genus 
y. The genus of a graph G is the minimum y such that G embeds on S,. 
The graphs embeddable on the surfaces of genus 0, 1, 2 are the planar, 
toroidal, and double-toroidal graphs, respectively (the surface with two 
handles is the double-torus). 


The theory of planar graphs extends in some ways to graphs embeddable on 
higher surfaces; we discuss this only briefly, for cultural interest. Drawings of 
large graphs on surfaces of large genus are hard to follow, even on the pretzel 
(S3). Locally, the surface looks like a plane sheet of paper. To draw the graph 
we want to lay the entire surface flat; to do this we must cut the surface. If we 
keep track of how the edges should be pasted back together to get the surface, 
we can describe the surface on a flat piece of paper. Consider first the torus. 


This is the source of the joke that a topologist is a person who can’t tell the difference 
between a doughnut and a coffee cup. 
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6.3.21. Example. Combinatorial description of the torus. 


=> SF] | 


Cutting the closed tube once turns it into a cylinder, and then slitting the 

length of the cylinder allows us to lay it flat as a rectangle. Labeling the edges 
of the rectangle indicates how to paste it back together. The two sides of a cut 
labeled with the same letter are “identified”. 
_ Keeping track of the identifications is important because edges of an em- 
bedding on a surface may cross such a cut. When the edge reaches one border 
of the rectangle, it is reaching one side of the imagined cut. When it crosses 
the cut, it emerges from the identical point on the other copy of this border. 
The four “corners” of the rectangle correspond to the single point on the surface 
through which both cuts pass. 

These ideas lead to nice toroidal embeddings of K5, K3.3, and K7. a 


For surfaces of higher genus, there is some flexibility in making the cuts, 
but each way takes two cuts per handle before we can lay the surface flat. 
The usual representation comes from expressing the handles as “lobes” of the 
surface, with the cuts having a common point on the hub. 


6.3.22. Example. Laying the double torus flat. Below is a polygonal represen- 
tation for the double torus. Making the cuts is equivalent to adding loops at 
a single vertex until we have a one-face embedding of a bouquet of loops. In 
general, we make 2y cuts through a single point to lay S,, flat. 

Keeping track df the borders from each cut leads to representing S, by 
a 4y-gon in which a clockwise traversal of the boundary can be described by 
reading out the cuts as we traverse them. We record a cut using the notation 
of inverses when we traverse it in the opposite order. 

Since we are following the boundary of a single face, with our left hand 
always on the wall, each edge will be followed once forward and once backward. 
For the example here, the traversal is a1 Bia; B las Boas : Ba 

Each surface S, has a layout of the form a fia; : By Fess a, By a, By 1 Other 
layouts result from other ways of making the cuts — different ways of embedding 
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a bouquet of 2y loops. For example, the double torus can also be represented 
by an octagon with boundary ofyda~!p-!y-18-1. | 


ay 

Be By 

a2 
4 
Be Bi 
ag 

6.3.23. Remark. Euler’s Formula for S,. A 2-cell is a region such that every 
closed curve in the interior can be continuously contracted to a point. A 2-cell 


embedding is an embedding where every region is a 2-cell. Euler’s Formula 
generalizes for 2-cell embeddings of connected graphs on S, (Exercise 35) as 


n—et f=2-2y. 


For example, our embedding of Kz on the torus (y = 1) has 7 vertices, 21 
edges, 14 faces, and 7 - 21+ 14 = 0. The proof of Euler’s Formula for S, is like 
the proof in the plane, except that the basis case of 1-vertex graphs needs more 
care. It requires showing that it takes 2y cuts to lay the surface flat (that is, to 
obtain a 2-cell embedding of a graph with one vertex and one face). | 


6.3.24. Lemma. Every simple n-vertex graph embedded on S, has at most 
3(n — 2+ 2y) edges. 


Proof: Exercise 35. | 


Note that K7 satisfies Lemma 6.3:24 with equality on the torus (y = 1), as 
every face in the toroidal embedding of K7 is a 3-gon. Hence K7 is a maximal 
toroidal graph. Rewriting e < 3(n —2-+ 2y) yields a lower bound on the number 
of handles we must add to obtain a surface on which G is embeddable; thus 
y(G) = 1+ (e — 3n)/6. 

Lemma 6.3.24 leads to an analogue of the Four Color Theorem for S,,. 


6.3.25. Theorem. (Heawood’s Formula—Heawood [1890]) If G is embeddable 
on S, with y > 0, then x(G) < | (7 ran ee 48y)/2|. 


Proof: Let c = (7+ ./1+ 48y)/2. It suffices to prove that every simple graph 
embeddable on S, has a vertex of degree at most c — 1; the bound on x(G) then 
follows by induction on n(G). Since x(G) < c for all graphs with at most c 
vertices, so need only consider n(G) > c. 

We use Lemma 6.3.24 to show that the average (and hence minimum) de- 
gree is at most c— 1. The second inequality below follows from y > Oandn > c. 
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Since c satisfies c? — 7c + (12 — 12y) = 0, we have c — 1 = 6 — (12 — 12y)/c, so 
the average degree satisfies the desired bound. 
2e Z 6(n —2+ 2y) 262 12 —12y 
n n c 


=c-—l. | 


The key inequality here fails when y = 0. Thus the argument is invalid 
for planar graphs, even though the formula reduces to x(G) < 4 when y = 0. 
Proving that the Heawood bound is sharp involves embedding K,, on S, with 
y = [(n — 3)(n — 4)/12]. The proof breaks into cases by the congruence class of 
n modulo 12 (K7 is the first example in the easy class). Completed in Ringel— 
Youngs [1968], it comprises the book Map Color Theorem (Ringel [1974]). 

Having considered the coloring problem on S,,, one naturally wonders which 
graphs embed on S,. Planar graphs have many characterizations, beginning 
with Kuratowski’s Theorem (Theorem 6.2.2) and Wagner’s Theorem (Exercise 
6.2.12). On any surface, embeddability is preserved by deleting or contracting 
an edge. Thus every surface has a list of “minor-minimal” obstructions to em- 
beddability. Wagner’s Theorem states that the list for the plane is {K33, Ks}; 
every nonplanar graph has one of these as a minor. 

More than 800 minimal forbidden minors are known for the torus. For each 
surface, the list is finite; this follows from the much more general statement 
below (the subdivision relation in Kuratowski’s Theorem leads to infinite lists). 
6.3.26. Theorem. (The Graph Minor ‘Theorem—Robertson—Seymour [1985]) 

In any infinite list of graphs, some graph is a minor of another. | 


This is perhaps the most difficult: theorem known in graph theory. The 
complete proof takes well over 500 pages (without computer assistance) in a se- 
ries of 20 papers stretching beyond the year 2000. It has many ramifications 
about structure of graphs and complexity of computation. The techniques in- 
volved in the proof have spawned new areas of graph theory. Some aspects of 
these techniques and their relation to the proof of the Graph Minor Theorem 
are presented in the final chapter of the text by Diestel [1997]. 


EXERCISES 


6.3.1. (—) State a polynomial-time algorithm that takes an arbitrary planar graph as 
input and produces a proper 5-coloring of the graph. 


6.3.2. (—) A graph G is k-degenerate if every subgraph of G has a vertex of degree at 
most k. Prove that every k-degenerate graph is k + 1-colorable. 


6.3.3. (—) Use the Four Color Theorem to prove that every outerplanar graph is 3- 
colorable. . 


6.3.4. (—) Determine the crossing numbers of Ko2.2.2, K4,4, and the Petersen graph. 
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6.3.5. Prove that every planar graph decomposes into two bipartite graphs. (Hedetniemi 
[1969], Mabry [1995]) 


6.3.6. Without using the Four Color Theorem, prove that every planar graph with at 
most 12 vertices is 4-colorable. Use this to prove that every planar graph with at most 
32 edges is 4-colorable. 


6.3.7. (!) Let H be a configuration in a planar triangulation (Definition 6.3.2). Let H’ 
be obtained by labeling the neighbors of the ring vertices with their degrees and then 
deleting the ring vertices. Prove that H can be retrieved from H’. 


6.3.8. Create a configuration with ring size 5 in a planar triangulation such that every 
internal vertex has degree at least five. 


6.3.9. (+) Prove that every planar configuration having ring size at most four is re- 
ducible. (Hint: The ring is a separating cycle C. Prove that if smaller triangulations are 
4-colorable, then the C-lobes of G have 4-colorings that agree on C.) (Birkhoff [1913]) 


6.3.10. Grétzsch’s Theorem [1959] (see Steinberg [1993], Thomassen [1994a]) states 
that a triangle-free planar graph G is 3-colorable. Hence a(G) > n(G)/3. Tovey— 
Steinberg [1993] proved that a(G) > n(G)/3 always. Prove that this is best possible by 
considering the family of graphs G, defined as follows: G, is the 5-cycle, with vertices 
a, Xo, X1, Yi, Z1 in order. For k > 1, G, is obtained from G,_; by adding the three vertices 
Xk» Yes 2% and the five edges x,_1X4, Xk ks VeZkvZkVk—-1s ZkXk-2- The graph G3 is shown on the 
left below. (Fraughnaugh [1985]) 


Xo xy XQ X3 
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6.3.11. Define a sequence of plane graphs as follows. Let G; be C,. For n > 1, obtain 
G, from G,_; by adding a new 4-cycle surrounding G,,_;, making each vertex of the new 
cycle also adjacent to two consecutive vertices of the previous outside face. The graph 
G3 is shown on the right above. Prove that if n is even, then every proper 4-coloring of 
G, uses each color on exactly n vertices. (Albertson) 


6.3.12. (!) Without using the Four Color Theorem, prove that every outerplanar graph 
is 3-colorable. Apply this to prove the Art Gallery Theorem: If an art gallery is laid out 
as a simple polygon with n sides, then it is possible to place |n/3] guards such that every 
point of the interior is visible to some guard. Construct a polygon that requires [n/3] 
guards. (Chvatal [1975], Fisk [1978]) 
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6.3.13. An art gallery with walls is a polygon plus some nonintersecting chords called 
“walls” that join vertices. Each interior wall has a tiny opening called a “doorway”. A 
guard in a doorway can see everything in the two neighboring rooms, but a guard not in 
a doorway cannot see.past a wall. Determine the minimum number t such that for every 
walled art gallery with n vertices, it is possible to place t guards so that every interior 
point is visible to some guard. (Hutchinson [1995], Kiindgen [1999]) 


6.3.14. (+) Prove that a maximal planar graph is 3-colorable if and only if it is Eulerian. 
(Hint: For sufficiency, use induction on n(G). Choose an appropriate pair or triple of 
adjacent vertices to replace with appropriate edges.) (Heawood [1898]) 


6.3.15. (!) Prove that the vertices of an outerplanar graph can be partitioned into two 
sets so that the subgraph induced by each set is a disjoint union of paths. (Hint: Define 
the partition using the parity of the distance from a fixed vertex.) (Akiyama—Era— 
Gervacio [1989], Goddard [1991]) 


6.3.16. (—) Prove that the 4-dimensional cube Q, is nonplanar. Decompose it into two 
isomorphic planar graphs; hence Q, has thickness 2. 


6.3.17. Prove that K, has thickness at least ewe (Hint: [=] = | = |.) Show 
that equality holds for Kg by finding a.self-complementary planar graph with 8 ver- 
tices. (Comment: The thickness equals | 27 | except that Kg and Kyo have thick- 
ness 3; Beineke—Harary [1965] for n 4 4 mod 6, and Alekseev—Gonéakov [1976] for 
n= 4 mod 6.) Z 


6.3.18. Decompose Kg into three pairwise-isomorphic planar graphs. 


6.3.19. Prove that if G has thickness 2, then x(G) < 12. Use the two graphs below to 
show that x(G) may be as large as 9 when G has thickness 2. (Sulanke) 


of 
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6.3.20. (!) When r is even and s is greater than (r — 2)?/2, prove that the thickness of 
K,, isr/2. (Beineke-Harary—Moon [1964]) 


6.3.21. Determine v(K1,2.2.2) and use it to compute v(Ke222,). 


6.3.22. Prove that K3.22 has no planar subgraph with 15 edges. Use this to give another 
proof that v(K32,2) = 2. 


6.3.23. Let M, be the graph obtained from the cycle C, by adding chords between ver- 
tices that are opposite (if n is even) or nearly opposite (if n is odd). The graph M, is 
3-regular ifn is even, 4-regular if n is odd. Determine v(M,,). (Guy—Harary [1967]) 


6.3.24. The graph P* has vertex set [n] and edge set {ij: |i — j| < k}. Prove that P? is 
a maximal planar graph. Use a planar embedding of P? to prove that v(P*) = n — 4. 
(Harary—Kainen [1993]) 


272 Chapter 6: Planar Graphs 


6.3.25. For every positive integer k, construct a graph that embeds on the torus but 
requires at least k crossings when drawn in the plane. (Hint: A single easily described 
toroidal family suffices; use Proposition 6.3.13.) 


6.3.26. (!) Use Kleitman’s computation that v(Ke,,) = 6 [2 ns | [" net | to give counting 
arguments for the following lower bounds. 
a) v(Kmn) = m@= || | 254]. (Guy [1970]) 


b) vV(K,) = = ap" + O(p ). 


6.3.27. (!) It is conjectured that v(Km,) = [3 | La | [3 | | 52 |. Suppose that this 
conjecture holds for K,,,, and that m is odd. Prove that the conjecture then holds also 
for Km+in- (Kleitman [1970]) 


6.3.28. (!) Suppose that m and n are odd. Prove that in all drawings of K,,,,, the parity 
of the number of pairs of edges that cross is the same. (We consider only drawings where 
edges cross at most once and edges sharing an endpoint do not cross.) Conclude that 
v(Kinm) is odd when m — 3 and n — 3 are divisible by 3 and even otherwise. 


6.3.29. Suppose that n is odd. Prove that in all drawings of K,,, the parity of the number 
of pairs of edges that cross is the same. Conclude that v(K,,) is even when n is congruent 
to 1 or 3 modulo 8 and is odd when n is congruent to 5 or 7 modulo 8. 


6.3.30. (!) It is known that v(C,,0C,) = (m—2)n ifm < min{5, n}. Also v(K4,0C,) = 3n. 
.a) Find drawings in the plane to establish the upper bounds. 
b) Prove that v(C30Cs3) > 2. (Hint: Find three subdivisions of K3,3 that together 
use each edge exactly twice.) 


6.3.31. Let fm= = V(Kn, n, n)- 


a) Show that 3v(Knn) < f(n) < 3(%). = 

b) Show that v(K322) = 2 and v(Ka, 
9 < v(K3.3,3) < 15. 

c) Exercise 6.3.26a shows that the lower bound in part (a) is at least (3/20)n*+ O(n). 
Improve it by using a recurrence to show that f(n) > n3(n — 1)/6. 

d) The upper bound in part (a) is 3n* + O(n*). Improve it to f(n) < 2n* + O(n). 
(Hint: One construction embeds the graph on a tetrahedron and generalizes to a con- 
struction for K;_,; another uses K, and generalizes to a construction for Kn,...n-) 


6.3.32. (*) Construct an embedding of a 3-regular nonbipartite simple graph on the 
torus so that every face has even length. 


j= 3. Show that 5 < v(K3,3,2) < 7 and 


cole ee 


6.3.33. (*) Suppose that n is at least 9 and is not a prime or twice a prime. Construct a 
6-regular toroidal graph with n vertices. 


6.3.34. (*) An embedding of a graph on a surface is regular if its faces all have the 
same length. Construct regular embeddings of K4.4, K3,¢, and K3,3 on the torus. 


6.3.35. (x) Prove Euler’s Formula for genus y: For every 2-cell embedding of a graph 
on the surface $,, the numbers of vertices, edges, and faces satisfy n —e + f = 2 — 2y. 
Conclude that an n-vertex graph embeddable on S$, has at most 3(n — 2 + 2y) edges. 


6.3.36. (*) Use Euler’s Formula for S, to prove that y(K3,3,.) =n — 2, and determine 
the value exactly for n < 3. 


6.3.37. («) For every positive integer k, use Euler’s Formula for higher surfaces to prove 
that there exists a planar graph G such that y(GO Ke) > k. 


Chapter 7 


Edges and Cycles 


7.1. Line Graphs and Edge-coloring 


Many questions about vertices have natural analogues for edges. Inde- 
pendent sets have no adjacent vertices; matchings have no “adjacent” edges. 
Vertex colorings partition vertices into independent sets; we can instead parti- 
tion edges into matchings. These pairs of problems are related via line graphs 
(Definition 4.2.18). Here we repeat the definition, emphasizing our return to 
the context in which a graph may have multiple edges. We use “line graph” and 
L(G) instead of “edge graph” because E(G) already denotes the edge set. 


7.1.1. Definition. The line graph of G, written L(G), is the simple graph 
whose vertices are the edges of G, with ef € E(L(G)) whene and f havea 
common endpoint in G. 


L(G) 
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Some questions about edges in a graph G can be phrased as questions 
about vertices in L(G). When extended to all simple graphs, the vertex question 
may be more difficult. If we can solve it, then we can answer the original 
question about edges in G by applying the vertex result to L(G). 

In Chapter 1, we studied Eulerian circuits. An Eulerian circuit in G yields 
a spanning cycle in the line graph L(G). (Exercise 7.2.10 shows that the con- 
verse need not hold!) In Section 7.2, we study spanning cycles for graphs in 
general. As discussed in Appendix B, this problem is computationally difficult. 

In Chapter 3, we studied matchings. A matching in G becomes an inde- 
pendent set in L(G). Thus a’(G) = a(L(G)), and the study of a’ for graphs is 
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the study of a for line graphs. Computing a is harder for general graphs than 
for line graphs. Section 3.1 considers this for bipartite graphs, and we describe 
the general case briefly in Appendix B. 

In Chapter 4, we studied connectivity. Menger’s Theorem gave a min-max 
relation for connectivity and internally disjoint paths in all graphs. By applying 
this theorem to an appropriate line graph, we proved the analogous min-max 
relation for edge-connectivity and edge-disjoint paths in all graphs. 

In Chapter 5, we studied vertex coloring. Coloring edges so that each color 
class is a matching amounts to proper vertex coloring of the line graph. Thus 
edge-coloring is a special case of vertex coloring and therefore potentially easier. 
We discuss edge-coloring in this section. Our main result, when stated in terms 
of vertex coloring of line graphs, is an algorithm to compute x (H) within 1 when 
H is the line graph of a simple graph. 

Thus line graphs suggest the problems of edge-coloring and spanning cy- 
cles that are discussed in this chapter. We first study these separately. In 
Section 7.3, we study their connections to each other and to planar graphs. 

In applying algorithms for line graphs, we may need to know whether G isa 
line graph. There are good algorithms to check this; they use characterizations 
of line graphs, which we postpone to the end of this section. 


EDGE-COLORINGS 
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In Example 1.1.11 that introduced vertex coloring, we needed to sched- 
ule Senate committees. Edge-coloring’problems arise when the objects being 
scheduled are pairs of underlying elements. 


7.1.2. Example. Edge-coloring of Ko,. In a league with 2n teams, we want to 
schedule games so that each pair of teams plays a game, but each team plays 
at most once a week. Since each team must play 2n — 1 others, the season lasts 
at least 2n — 1 weeks. The games of each week must form a matching. We can 
schedule the season in 2n — 1 weeks if and only if we can partition E(K2,) into 
2n —1 matchings. Since Ke, is 2n — 1-regular, these must be perfect matchings. 

The figure below describes the solution. Put one vertex in the center. Ar- 
range the other 2n — 1 vertices cyclically, viewed as congruence classes modulo 
2n — 1. As in Theorem 2.2.16, the difference between two congruence classes is 
1 if they are consecutive, 2 if there is one class between them, and so on up to 
difference n — 1. There are 2n — 1 edges with each differencei, for 1 <i <n-—1. 
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Each matching consists of one edge from each difference class plus one edge 
involving the center vertex. We show one such matching in bold. Rotating the 
picture (to obtain the solid matching) yields n new edges; again they are one 
of each length plus one to the center. The 2n — 1 rotations of the figure yield 
the desired matchings, since these matchings take distinct edges from each 
difference class and distinct edges involving the center vertex. a 


7.1.3. Definition. A k-edge-coloring of G is a labeling f: E(G) — S, where 
|S| = k (often we use S = [k]). The labels are colors; the edges of one color 
form a color class. A k-edge-coloring is proper if incident edges have dif- 
ferent labels; that is, if each color class is a matching. A graph is k-edge- 
colorable if it has a proper k-edge-coloring. The edge-chromatic num- 
ber x’(G) ofa loopless graph G is the least k such that G is k-edge-colorable. 


Chromatic index is another name for x’(G). Since edges sharing a vertex 
need different colors, x’(G) > A(G). Vizing [1964] and Gupta [1966] indepen- 
dently proved that A(G) + 1 colors suffice when G is simple; this is our main 
objective. A clique in L(G) is a set of pairwise-intersecting edges of G. When G 
is simple, such edges form a star or a triangle in G (Exercise 9). For the hered- 
itary class of line graphs of simple graphs, Vizing’s Theorem thus states that 
x(H) < w(H) + 1; thus line graphs are “almost” perfect. 

In contrast to x(G) in Chapter 5, multiple edges greatly affect x’(G). A 
graph with a loop has no p¥oper edge- coloring; the adjective “loopless” excludes 
loops but allows multiple edges. 
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7.1.4. Definition. In a graph G with multiple edges, we say that a vertex pair 
x, y is an edge of multiplicity m if there are m edges with endpoints x, y. 
We write (xy) for the multiplicity of the pair, and we write (G) for the 
maximum of the edge multiplicities in G. 


7.1.5. Example. The “Fat Triangle”. For loopless graphs with multiple edges, 
x'(G) may exceed A(G) +1. Shannon [1949] proved that the maximum of x’(G) 
in terms of A(G) alone is 3A(G)/2 (see Theorem 7.1.13). Vizing and Gupta 
proved that x’(G) < A(G) + 4(G), where y.(G) is the maximum edge multiplic- 
ity. The graph below achieves both bounds. The edges are pairwise intersecting 
and hence require distinct colors. Thus x’(G) = 3A(G)/2 = A(G) + “(G). | 


7.1.6. Remark. We have observed that always x'(G) > A(G). The upper 
bound x’(G) < 2A(G) — 1 also follows easily. Color the edges in some order, 
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always assigning the current edge the least-indexed color different from those 
already appearing on edges incident to it. Since no edge is incident to more 
than 2(A(G) — 1) other edges, this never uses more than 2A(G) — 1 colors. The 
procedure is precisely greedy coloring for vertices of L(G). 


x'(G) = x(L(G)) < A(L(G)) +1 < 2A(G) — 1. o 


For bipartite graphs, the results of Chapter 3 improve the upper bound of 
Remark 7.1.6, achieving the trivial lower bound even when multiple edges are 
allowed. Furthermore, there is a good algorithm to praduce a proper A(G)- 
edge-coloring in a bipartite graph G. 


7.1.7. Theorem. (Konig [1916]) If G is bipartite, then x’(G) = A(G). 


Proof: Corollary 3.1.13 states that every regular bipartite graph H has a 1- 
factor. By induction on A(#), this yields a proper A(H)-edge-coloring. It there- 
fore suffices to show that for every bipartite graph G with maximum degree k, 
there is a k-regular bipartite graph H containing G. 

To construct such a graph, first add vertices to the smaller partite set of G, 
if necessary, to equalize the sizes. If the resulting G’ is not regular, then each 
partite set has a vertex with degree less than k. Add an edge with these two 
vertices as endpoints. Continue adding such edges until the graph becomes 
k-regular; the resulting graph is H. | 
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7.1.8. Definition. A decomposition of a regular graph G into 1-factors is a 
1-factorization of G. A graph with a 1-factorization is 1-factorable. 


An odd cycle is not 1-factorable; x’(Cam+1) = 3 > A(Com+41). The Petersen 
graph also requires an extra color, but only. one extra color. 


7.1.9. Example. The Petersen graph is 4-edge-chromatic (Petersen {1898]). 
The Petersen graph is 3-regular; 3-edge-colorability requires a 1-factorization. 
Deleting a perfect matching leaves a 2-factor; all components are cycles. The 
1-factorization can be completed only if these are all even cycles. 

Thus it suffices to show that every 2-factor is isomorphic to 2C;. Con- 
sider the drawing consisting of two 5-cycles and a matching (the cross edges) 
between them. We consider cases by the number of cross edges used. 
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Every cycle uses an even number of cross edges, so a 2-factor H has an 
even number m of cross edges. If m = 0 (left figure), then H = 2Cs. 

If m = 2 (central figure), then the two cross edges have nonadjacent end- 
points on the inner cycle or the outer cycle. On the cycle where their endpoints 
are nonadjacent, the remaining three vertices force all five edges of that cycle 
into H, which violates the 2-factor requirement. 

If m = 4 (right figure), then the cycle edges forced into H by the unused 
cross edges form a 2P; whose only completion to a 2-factor in H is 2C5. 

Note that since C; is 3-edge-colorable, the graph is 4-edge-colorable. | 


Now we consider all simple graphs. We make A(G) + 1 colors available 
and build a proper edge-coloring, incorporating edges one by one until we have 
a proper A(G) + 1-edge-coloring of G. The algorithm runs surprisingly quickly. 


7.1.10. Theorem. (Vizing [1964, 1965], Gupta [1966]) If G is a simple graph, 
then x’(G) < A(G) +1. 


Proof: Let f be a proper A(G) + 1l-edge-coloring of a subgraph G’ of G. If 
G’ # G, then some edge uv is uncolored by f. After possibly recoloring some 
edges, we extend the coloring to include uv; call this an augmentation. After 
e(G) augmentations, we obtain a proper A(G) + 1-edge-coloring of G. 

Since the number of colors exceeds A(G), every vertex has some color not 
appearing on its incident edges. Let ag be a color missing at u. We generate a 
list of neighbors of u and a corresponding list of colors. Begin with vp = v. 

Let a; be a color missing at vo. We may assume that a; appears at uv on 
some edge uv1; otherwise, we would use a; on up. 

Let aa be a color missing at v}. We may assume that ag appears at uv on 
some edge uv2; otherwise, we would replace color a; with ag on uv; and then 
use da) ON vvo to augment the coloring. 

Having selected: uv;_1; with color a;_1, let a; be a color missing at v;_1. If a; 
is missing at uv, then we use a; on uv;_; and shift color a; from uv; to uv;_, for 
1 < j <i -—1 to complete the augmentation. We call this downshifting from i. 
If a; appears at u (on some edge uv;), then the process continues. 


a3? \, a2 u 
v2 

NX \ 

\ 

— \ 

ay é 
» 
v1 vo =U 


Since we have only A(G) +1 colors to choose from, the list of selected colors 
eventually repeats (or we complete the augmentation by downshifting). Let / 
be the smallest index such that a color missing at v; is in the list aj, ..., a); let 
this color be a,. Instead of extending the list, we use this repetition to perform 
the augmentation in one of several ways. 

The color a, missing at v; is also missing at y,_; and appears on uy. If 
ao does not appear at v;, then we downshift from v; and use color ap on uv; to 
complete the augmentation. Hence we may assume that dp appears at v. 
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Let P be the maximal alternating path of edges colored ag and a, that 
begins at v, along color aj. There is only ore such path, because each vertex 
has at most one incident edge in each color (we ignore edges not yet colored). 
To complete the augmentation, we will interchange colors ag and a, on P and 
downshift from an appropriate neighbor of u, depending on where P goes. 

If P reaches y, then it arrives at y4, along an edge with color ao, follows v,.u 
in color c,, and stops at u, which lacks color ap. In this case, we downshift from 
v, and switch colors on P (left picture below). 

If P reaches 41, then it reaches v,_; on color do and stops there, because 
a, does not appear at 1,_1. In this case, we downshift from v,_1, give color ag to 
uv,_1, and switch colors on P (middle picture). 

If P does not reach y% or v,_1, then it ends at some vertex outside 
{u, vj, Vg, ye-1}. In this case, we downshift from v,;, give color ap to uv;, and 
switch colors on P (rightmost picture). 

In each case, the changes described yield a proper A(G) + 1-edge- ee 
of G’ + uv, so we have completed the desired augmentation. 


v1 Vv 


P reaches v; P reaches 14_1 otherwise 


For simple graphs, we now have only two possibilities for x’. 


7.1.11. Definition. A simple graph G is Class 1 if x’(G) = A(G). It is Class 2 
if x’/(G) = A(G) +1. 


Determining whether a graph is Class 1 or Class 2 is generally hard (Holyer 
[1981]; see Appendix B). Thus we seek conditions that forbid or guarantee A(G)- 
edge-colorability. Examples of such conditions include Exercises 24-27. 


7.1.12.* Remark. There is an obvious necessary condition for a graph to be 
Class 1 that is conjectured to be sufficient when A(G) > $n(G). Part (a) of 
Exercise 27 observes that a subgraph of G with odd order is an obstruction 
to A(G)-edge-colorability if it has too many edges. A subgraph H of a simple 
graph G is an overfull subgraph if n(#) is odd and 2e(H)/(n(H) — 1) > A(G). 

The Overfull Conjecture (Chetwynd—Hilton [1986]—see also Hilton 
[1989]) states that if A(G) > n(G)/3, then a simple graph G is Class 1 if and 
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only if G has no overfull subgraph. The Petersen graph with a vertex deleted 
shows that the condition is not sufficient when A(G) = n(G)/3 (Exercise 28). 

The Overfull Conjecture implies the 1-factorization Conjecture: Ifr > 
m (or r > m—1 ifm is even), then every r-regular simple graph of order 2m is 
Class 1. This also is sharp (Exercise 29). 

The conclusions of the two conjectures hold when A(G) is large enough 
(Chetwynd-—Hilton [1989], Niessen—Volkmann [1990], Perkovic—Reed [1997], 
Plantholt [2001]. | 


When G has multiple edges, y’(G) < |[8A(G)/2] (Shannon [1949]) and 
x'(G) < A(G) + “(G) (Vizing [1964, 1965], Gupta [1966]) These bounds follow 
(Exercise 35) from that of Andersen [1977] and Goldberg [1977, 1984]: 


x'(G) < max{A(G), maxp | (d(x) + (xy) + u(yz) + d(z)) |} 


where P = {x, y,z € V(G): y € N(x) NN(z)}. Proving this bound uses the meth- 
ods of Theorem 7.1.10 plus counting arguments. To illustrate the use of count- 
ing arguments, we prove Shannon’s Theorem from that of Vizing and Gupta. 


7.1.13.* Theorem. (Shannon [1949]) If G is a graph, then x’(G) < 2A(G). 
Proof: Let k = x’(G), and assume k = (3/2)A(G). Let G’ be a minimal sub- 
graph of G with x’(G’) = k. Since k < A(G’) + u(G’) (Vizing—Gupta), we obtain 
L(G’) > A(G)/2. Let e with endpoints x, y be an edge with multiplicity u(G’). 

Let f be a proper k — 1-edge-coloring of G’ — e. In G’ — e, both x and y 
have degree at most A(G) — 1, so under f at least (k — 1) — (A(G) — 1) colors 
are missing at x, and similarly at y. No color is missing at both, since G’ is not 
k — 1-edge-colorable. Accounting for the (G’) — 1 colors used on edges with 
endpoints x, y yields 


2(k — A(G)) + (A(G)/2) — 1 < 2(k — A(G)) + (G’) -1 <k—-1, 
and hence k < (3/2)A(G). | 


Finally, there is a general conjecture analogous to the Overfull Conjecture. 


7.1.14.* Conjecture. (Goldberg [1973, 1984], Seymour [1979a]) 


If x'(G) => A(G) + 2, then x’(G) = maxycg | ite |: . 


CHARACTERIZATION OF LINE GRAPHS (optional) 


Characterizations of line graphs can lead to good algorithms to test whether 
a graph G is a line graph and, if so, to obtain H such that L(H) = G. 


7.1.15. Example. To illustrate the ideas, we prove that the rightmost graph 
below is not the line graph of a simple graph. The kite G (two triangles with 
a common edge) is the line graph of the paw H (a claw plus an edge). By case 
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analysis, we find that H is the only simple graph whose line graph is G, and 
the edges becoming the vertices of degree 2 in G must be the dashed edges. 
The rightmost graph adds a vertex to G having only the vertices of degree 
2 as neighbors. The result is not a line graph, because there is no way to add 
an edge to H that shares an endpoint with each dashed edge without sharing 
an endpoint with a solid edge. | 


G=L(A) H not a line graph 


Our first characterization encodes the process of taking the line graph. If 
G = L(A) and H issimple, then each v € V(AH) with d(v) > 2 generates a clique 
Q(v) in G corresponding to edges incident to v. These cliques partition E(G). 
Furthermore, each vertex e € V(G) wae only to the cliques generated by the 
two endpoints of e € E(fA). 

For example, when G is the kite, we can partition E(G) into three cliques 
(a triangle plus two edges), each vertex covered at most twice. These three 
cliques correspond to the vertices of degree at least 2 in the paw. The rightmost 
graph above does not have such a partition. 


7.1.16. Theorem. (Krausz [1943]) Fora simple graph G, there is a solution to 
L(H) = Gif and only if G decomposes into complete subgraphs, with each 
vertex of G appearing in at most two in the list. 


Proof: We argued above that the condition is necessary. Note that when G = 
L(A), the vertices of G that belong to only one of the cliques we have defined 
are those corresponding to edges of H that are incident to leaves. 

For sufficiency, let S;,..., 5, be the vertex sets of the specified complete 
subgraphs. We construct H such that G = L(A). Isolated vertices of G become 
isolated edges of H, so we may assume that 5(G) > 1. Let v1,..., v; be the 
vertices of G (if anv) that appear in exactly one of S;,..., Syn. Give H one 
vertex for each set in the list A = Sj, ..., Sy, {v1}, ..., {v;}}, and let vertices of H 
be adjacent if the corresponding sets intersect. 

Each vertex of G appears in exactly two sets in A, and no two vertices 
appear in the same two sets. Hence H is a simple graph with one edge for each 
vertex of G. If vertices are adjacent in G, then they appear together in some 5§;, 
and the corresponding edges of H share the vertex for §;, HenceG = L(H). @ 


Krausz’s characterization does not directly yield an efficient test for line 
graphs, because there are too many possible decompositions to test. The next 
characterization tests substructures of fixed size and therefore yields a good 
algorithm. We say that each triangle T in G is odd or even as defined below. 
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T is odd if |N(v) M V(T)| is odd for some v € V(G). 

T is even if |N(v) M V(T)| is even for every v € V(G). 
An induced kite is a double triangle; it consists of two triangles sharing an 
edge, and the two vertices not in that edge are nonadjacent. 


7.1.17. Theorem. (van Rooij and Wilf [1965]) For a simple graph G, there is a 
solution to L(H) = Gif and only if G is claw-free and no double triangle of 
G has two odd triangles. 


Proof: Necessity. Suppose that G = L(H). A vertex e in G with neighbors 
x, y, Z corresponds to an edge e in H incident to edges x, y, z. Since e has only 
two endpoints in H, two of x, y,z are incident at one of them and hence are 
adjacent in G. This forbids the claw as an induced subgraph of G. 

For the other condition, we saw in Example 7.1.15 that the vertices of a 
double triangle in G must correspond to the edges of a paw in H. In particular, 
the vertices of one of these triangles in G correspond to the edges of a triangle 
in H. This triangle must be even, because every edge in H incident to exactly 
one vertex of a triangle shares an endpoint with exactly two of its edges. Hence 
for each double triangle in G, at least one of its triangles is even. 

Sufficiency. Suppose that G satisfies the specified conditions. We may 
assume that G is connected; otherwise; we apply the construction to each com- 
ponent. The case where G is claw-free and has a double triangle with both 
triangles even is very special; there are only three such graphs (Exercise 38). 
Here we consider only the general case, in which every double triangle of G has 
exactly one odd triangle. 

By Theorem 7.1.16, it suffices to a G into complete subgraphs, us- 
ing each vertex in at most two of them. Let S;,..., 5; be the maximal complete 
subgraphs of G that are not even triangles, and let T,,..., T; be the edges that 
belong to one even triangle and no odd triangle. We claim that together these 
form the desired decomposition B. 


T 


T 


Every edge appears in a maximal complete subgraph, but every triangle 
in a complete subgraph with more than three vertices is odd. Hence each edge 
T; in the list is not in any S;. Also S; and S; share no edge, because G has 
no double triangles with both triangles odd. Hence the subgraphs in B are 
pairwise edge-disjoint. 

If e € E(G), then e is in some S; unless the only maximal clique containing 
e is an even triangle. In this case e is a T;, since we have forbidden double 
triangles with both triangles even. Hence B is a decomposition. 

It remains to show that each v € G appears in at most two of these sub- 
graphs. Suppose that v belongs to A, B, C € B. Edge-disjointness implies that 
v has neighbors x, y, z with each belonging to only one of {A, B, C}. Since G has 
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no induced claw, we may assume that x ~ y. By edge-disjointness, the trian- 
gle vxy cannot belong to a member of B. Hence it must be an even triangle. 
Therefore, z must have exactly one other edge to vxy, sayz <> x andz # y. But 
now the same argument shows zvx is an even triangle, and we have a double 
triangle with both triangles even. | 


Theorem 7.1.17 is close to a forbidden subgraph characterization. 


7.1.18. Theorem. (Beineke [1968]) A simple graph G is the line graph of some 
simple graph if and only if G does not have any of the nine graphs below 
as an induced subgraph. 


A Peo 


Proof: By Theorem 7.1.17, it suffices ti show that the eight graphs listed other 
than K;3 are the vertex-minimal claw-free graphs containing a double triangle 
with both triangles odd. Each such graph has a double triangle and one or 
two additional vertices that make the triangles odd by having one or three 
neighbors in the triangles. The details of showing that this is the full list are 
requested in Exercise 40. a 


ll 
ad 
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The characterizations in Theorems 7.1.17—7.1.18 yield algorithms to test 
whether G is a line graph that run in time polynomial in n(G). In fact, there is 
such an algorithm that runs in linear time (Lehot [1974]) and produces a graph 
H such that G = L(H) when G is a line graph. This graph H is unique if G has 
no component that is a triangle (Exercise 39). 


EXERCISES 


7.1.1. (—) For each graph G below, compute x/(G) and draw L(G). 


7.1.2. (—) Give an explicit edge-coloring to prove that x’(Q,) = A(Qx) 
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7.1.3. (—) Determine the edge-chromatic number of C,, 0 Ko. 
7.1.4. (—) Obtain an inequality for x’(G) in terms of e(G) and a’(G). 
7.1.5. (—) Prove that the Petersen graph is the complement of L(Ks). 
7.1.6. (—) Determine the number of triangles in the line graph of the Petersen graph. 
7.1.7. (—) Determine whether P; is a line graph. If so, find H such that L(H) = Ps. 
7.1.8. (—) Prove that L(Kinn) = Km OKn. 
® e e e ® 
7.1.9. Let G be a simple graph. Prove that vertices form a clique in L(G) if and only if 


the corresponding edges in G have one common endpoint or form a triangle. (Comment: 
Thus w(L(G)) = A(G) unless A(G) = 2 and some component of G is a triangle.) 


7.1.10. Let G be a simple graph without isolated vertices. Prove that if L(G) is con- 
nected and regular, then either G is regular or G is a bipartite graph in which vertices 
of the same partite set have the same degree. (Ray-Chaudhuri [1967]) 


7.1.11. (!) Let G be a simple graph. 
a) Prove that the number of edges in L(G) is S evic) (70) 
b) Prove that G is isomorphic to L(G) if and only if G is 2-regular. 


7.1.12. Let G be a connected simple graph. Use part (a) of Exercise 7.1.11 to determine 
when e(L(G)) < e(G). = 


50) 


At 


/m 


7.1.13. (+) Prove that the graph below isthe only simple graph whose line graph is 
isomorphic to its complement. (Albertson) = 


7.1.14. (!) Let G be a k-edge-connected simple graph. Prove that L(G) is k-connected 
and is 2k — 2-edge-connected. (Hint: For a minimum edge cut [S, S] in L(G), describe 
what the cut corresponds to in G and count its edges in terms of the vertices of G.) 


7.1.15. (!) Use Tutte’s 1-factor Theorem to prove that every connected line graph of even 
order has a perfect matching. Conclude from this that the edges of a simple connected 
graph of even size can be partitioned into paths of length 2. (Comment: Exercise 3.3.22 
shows that every connected claw-free graph has a perfect matching , but that stronger 
result is more difficult than this.) (Chartrand—Polimeni—Stewart [1973]) 


7.1.16. (*) Let G be a simple graph. Prove that y(L(G)) > y(G), where y(G) denotes 
the genus of G (Definition 6.3.20). (D. Greenwell) 


7.1.17. Compute the number of proper 6-edge-colorings of the graph below. 


7.1.18. (!) Give an explicit edge-coloring to prove that x’(K,,;) = A(K;;). 


7.1.19. (!) Prove that for every simple bipartite graph G, there is a A(G)-regular simple 
bipartite graph H that contains G. 
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7.1.20. (!) Let D be a digraph (loops allowed) such that d+(v) < d and d-(v) < d for 
all v € V(D). Prove that E(D) can be colored using at most d colors so that the edges 
entering each vertex have distinct colors and the edges exiting each vertex have distinct 
colors. (Hint: Transform the digraph into another object where a known result applies.) 


7.1.21. Algorithmic proof of Theorem 7.1.7. Let G be a bipartite graph with maximum 
degree k. Let f be a proper k-edge-coloring of a subgraph H of G. Let uv be an edge not 
in H; By using a path alternating in two colors, show that f can be altered and then 
extended to a proper k-edge-coloring of H + uv. Conclude that x'/(G) = A(G). 


7.1.22. Use Brooks’ Theorem to an appropriate graph to prove that if G is a simple 
graph with A(G) = 3, then G is 4-edge-colorable. (Comment: The result is a special 
case of Vizing’s Theorem; do not use Vizing’s Theorem to prove this.) 


7.1.23. (+) Let K(p, q) be the complete p-partite graph with q vertices in each partite 
set. Let G[H] denote the composition operation, in which each vertex of G expands into 
a copy of H. Note that K(p, q) = K(p, d)(K gal when d divides q. 

a) Show that if G has a decomposition into copies of F, then G[K,,] has a decom- 
position into copies of F[K,,]. Show also that the relation “G decomposes into spanning 
copies of F” is transitive. 

b) Cliques of even order decompose into 1-factors. Cliques of odd order decom- 
pose into spanning cycles. Use these statements and part (a) to.prove that K(p, q) 
decomposes into 1-factors when pq is even. (Hartman [1997]) 


7.1.24. (!) Let G and H be nontrivial simple graphs. Use Vizing’s Theorem to prove that 
x'(A) = A(#A) implies x/(G 0H) = A(GO#). 


7.1.25. Kotzig’s Theorem for cartesian products of simple graphs. 

a) Use Vizing’s Theorem to prove that_x'/(GO Kz) = A(GO Kg). 

b) Let Gi, G2 be edge-disjoint graphs with vertex set V, and let Hi, Hz be edge- 
disjoint graphs with vertex set W. Prove that (Gi; UG2)O(Ai UA2) = (G10A2)U (G20). 

c) Use parts (a) and (b) to prove that ‘(GO H) = A(GOA) if both G and H have 1- 
factors. (Comment: As a result, the product of the Petersen graph with itself is Class 1, 
which does not follow from Exercise 7.1.24. Here neither factor need be Class 1; there 
G need not have a 1-factor.) (Kotzig [1979], J. George [1991]) 


7.1.26. (!) Let G be a regular graph with a cut-vertex. Prove that x'(G) > AC). 


7.1.27. Density conditions for x'(G) > A(G). 

a) Prove that if n(G) = 2m + 1 and e(G) > m- A(G), then x’(G) > A(G). 

b) Prove that if G is obtained from a k-regular graph with 2m +1 vertices by deleting 
fewer than k/2 edges, then x’(G) > A(G). 

c) Prove that if G is obtained by subdividing an edge of a regular graph with 2m 
vertices and degree at least 2, then x’(G) > A(G). 


7.1.28. (*—) Prove that the Petersen graph has no overfull subgraph. 


7.1.29. Let G be the m — 1-regular connected graph formed from 2K,, by deleting an 
edge from each component and adding two edges between the components to restore 
regularity. Prove that G is not 1-factorable if m is odd and greater than 3. (Comment: 
This shows that the 1-factorization Conjecture (Remark 7.1.12) is sharp.) 
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7.1.30. («!) Overfull Conjecture = 1-factorization Conjecture (Remark 7.1.12). 

a) Prove that in a regular graph of even order, an induced subgraph is overfull if 
and only if the subgraph induced by the other vertices is overfull. 

b) Let G be an k-regular graph of order 2m having an overfull subgraph. Prove that 
k < mif m is odd and that k < m — 1 if m is even. 


7.1.31. Given an edge-coloring of a graph G, let c(v) denote the number of distinct colors 
appearing on edges incident to v. Among all k-edge-colorings of G, a coloring is optimal 
if it maximizes Deis c(v). 

a) Prove that if no component is an odd cycle, then G has a 2-edge-coloring where 
both colors appear at each vertex of degree at least 2. (Hint: Use Eulerian circuits.) 

b) Let f be an optimal k-edge-coloring of G in which color a appears at least twice 
at u € V(G) and color b does not appear at u. Let H be the subgraph of G consisting of 
edges colored a or b. Prove that the component of H containing u is an odd cycle. 

c) Let G be a bipartite graph. Conclude from part (b) that G is A(G)-edge-colorable. 
(Comment: These ideas also lead to a proof of Vizing’s Theorem.) (Fournier [1973]) 


7.1.32. Let G be a bipartite graph with minimum degree k. Prove that G has a k-edge- 
coloring in which at each vertex v, each color appears [d(v)/k] or |d(v)/k] times. (Hint: 
Use a graph transformation.) (Gupta [1966]) 


7.1.33. Use Vizing’s Theorem to prove that every simple graph with maximum degree 
A has an} “equitable” A + 1-edge-coloring: a proper edge-coloring with each color used 
[e(G)/(A + 1)] or Le(G)/(A + 1)] times. (dé Werra [1971], McDiarmid [1972]) 


7.1.34. Use Petersen’s Theorem (every 2k-regular graph has a 2-factor—Theorem 3.3.9) 
to prove that x'(G) < 3 [A(G)/2] when G is a loopless graph. 


7.1.35. Bounds on x'(G). Let P = {x,y,z € V(G): y € N(x) N N(z). Prove that the last 
bound below (Andersen [1977], Goldberg [1977, 1984]) implies the earlier bounds. 
x'(G) < [8A(G)/2]. (Shannon [1949])” 
x'(G) < A(G) + u(G). (Vizing [1964, 1965], Gupta [1966]) 
x'(G) < max{A(G), maxp | (d(x) + d(y) + d(z)) |}. (Ore [1967al) 


x'(G) < max{A(G), maxp | 3 (d(x) + u(xy) + w(yz) + d(z)) |}. 


7.1.36. (+) Forn # 8, prove that L(K,,) is the only 2n — 4-regular simple graph of order 
(5) in which nonadjacent vertices have four common neighbors and adjacent vertices 
have n — 2 common neighbors. (Comment: When n = 8, three exceptional graphs satisfy 


the conditions.) (Chang [1959], Hoffman [1960]) 


7.1.37. (+) Forn, mnot both equalling 4, prove that L(K,,,,) is the only (n+m—2)-regular 
simple graph of order mn in which nonadjacent vertices have two common neighbors, 
n(3) pairs of adjacent vertices have m — 2 common neighbors, and m(5) pairs of adja- 
cent vertices have n — 2 common neighbors. (Comment: When n = m = 4, there one 


exceptional graph—Shrikande [1959].) (Moon [1963], Hoffman [1964]) 


7.1.38. (*) Let G be a connected, simple, claw-free graph having a double triangle H 
' with each triangle even. Prove that G is one of the three graphs below, and conclude 
that G is a line graph. (Comment: This completes the proof of Theorem 7.1.17.) 
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7.1.39. (x) A Krausz decomposition of a simple graph H is a partition of E(H) into 
cliques such that each vertex of H appears in at most two of the cliques. 

a) Prove that for a connected simple graph H, two Krausz decompositions of H that 
have a common clique are identical. 

b) Find distinct Krausz decompositions for the graphs in Exercise 7.1.38. 

c) Prove that no other connected simple graph except Ks; has two distinct Krausz 
decompositions (use Exercise 7.1.38 and the proof of Theorem 7.1.17). 

d) Conclude that K1,3, K3 is the only pair of nonisomorphic connected simple graphs 
with isomorphic line graphs. (Whitney [1932a]) 


7.1.40. (x) Complete the proof of Theorem 7.1.18 by proving that a simple graph with 
no induced claw has a double triangle with both triangles odd if and only if it contains 
an induced subgraph among the other eight graphs listed in the theorem statement. 


7.2. Hamiltonian Cycles 


Studied first by Kirkman [1856], Hamiltonian cycles are named for Sir 
William Hamilton, who described a game on the graph of the dodecahedron in 
which one player specifies a 5-vertex path and the other must extend it to a 
spanning cycle. The game was marketed as the “Traveller’s Dodecahedron”, a 
wooden version in which the vertices were named for 20 important cities. 
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7.2.1. Definition. A Hamiltonian graph is a graph with a spanning cycle, 
also called a Hamiltonian cycle. 


Until the 1970s, interest in Hamiltonian cycles centered on their relation- 
ship to the Four Color Problem (Section 7.3). Later study was stimulated’ by 
practical applications and by the issue of complexity (Appendix B). 

No easily testable characterization is known for Hamiltonian graphs; we 
will study necessary conditions and sufficient conditions. Loops and multiple 
edges are irrelevant; a graph is Hamiltonian if and only if the simple graph 
obtained by keeping one copy of each non-loop edge is Hamiltonian. Therefore, 
in this section we restrict our attention to simple graphs; this is relevant 
when discussing conditions involving yertex degrees. 

For further material on Hamiltonian cycles, see Chvatal [1985a]. 
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NECESSARY CONDITIONS 


Every Hamiltonian graph is 2-connected, because deleting a vertex leaves 
a subgraph with a spanning path. Bipartite graphs suggest a way to strengthen 
this necessary condition. 


7.2.2. Example. Bipartite graphs. A spanning cycle in a bipartite graph visits 
the two partite sets alternately, so there can be no such cycle unless the partite 
sets have the same size. Hence K,,,, is Hamiltonian only ifm = n. Alternatively, 
we can argue that the cycle returns to different vertices of one partite set after 
each visit to the other partite set. | 


7.2.3. Proposition. If G has a Hamiltonian cycle, then for each nonempty set 
S C V, the graph G — S has at most |S| components. 


Proof: When leaving a component of G — S, a Hamiltonian cycle can go only to 
S, and the arrivals in S must use distinct vertices of 5S. Hence S must have at 
least as many vertices as G — S has components. a 


7.2.4. Definition. Let c(H) denote the number of components of a graph H. 


Thus the necessary condition is that c(G — S) < |S| for all @ #5 CV. This 
condition guarantees that G is 2-connected (deleting one vertex leaves at most 
one component), but it does not guarantee a Hamiltonian cycle. 


7.2.5. Example. The graph on the left below is bipartite with partite sets of 
equal size. However, it fails the necessary condition of Proposition 7.2.3. Hence 
it is not Hamiltonian. 


The graph on the right shows that the necessary condition is not sufficient. 
This graph satisfies the condition but has no spanning cycle. All edges incident 
to vertices of degree 2 must be used, but in this graph that requires three edges 
incident to the central vertex. 
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The Petersen graph is another non-Hamiltonian graph satisfying the con- 
dition. We proved in Example 7.1.9 that 2C; is the only 2-factor of the Petersen 
graph, so it has no spanning cycle. | 


7.2.6.* Remark. Strengthening a necessary condition may yield a sufficient 
condition. Perhaps requiring |S| > 2c(G—S) for every cutset 5 would guarantee 
a spanning cycle. A graph G is t-tough if |S| > tc(G — S) for every cutset § C V. 
The toughness of G is the maximum t such that G is t-tough. For example, 
the toughness of the Petersen graph is 4/3 (Exercise 23). 

By Proposition 7.2.3, spanning cycles require toughness at least 1. Chvatal 
[1974] conjectured that a sufficiently large toughness is sufficient. No value of 
toughness larger than 1 is necessary, since C,, itself is only 1-tough. For some 
years it was thought that toughness 2 would be sufficient. Enomoto—Jackson— 
Katerinis—Saito [1985] constructed non-Hamiltonian graphs with toughness 
2 —e for each e > 0. Finally, Bauer-Broersma—Veldman [2000] constructed 
non-Hamiltonian graphs with toughness approaching 9/4. Chvatal’s conjec- 
ture that some value of toughness suffices remains open. | 


SUFFICIENT CONDITIONS 


books 


The number of edges needed to force an n-vertex graph to be Hamiltonian 
is quite large (Exercises 26-27). Under conditions that “spread out” the edges, 
we can reduce the number of edges while still guaranteeing Hamiltonian cycles. 
The simplest such condition is a lower bound on the minimum degree; 5(G) > 
n(G)/2 suffices. We first note that no smaller minimum degree is sufficient. 


7.2.7. Example. The graph consisting of cliques of orders [(n + 1)/2] and 
[(n + 1)/2] sharing a vertex has minimum degree |(n — 1)/2]| but is not Hamil- 
tonian (not even 2-connected). 

For odd order, another non-Hamiltonian graph with this minimum degree 
is the biclique with partite sets of sizes (n — 1)/2 and (n + 1)/2. 

Proving that 6(G) > n(G)/2 forces a spanning cycle thus shows that 
L(n — 1)/2] is the largest value of the minimum degree among non-Hamiltonian 
graphs with n vertices. | 


7.2.8. Theorem. (Dirac [1952b]). If G is a simple graph with at least three 
vertices and 6(G) > n(G)/2, then G is Hamiltonian. 


Proof: The condition n(G) > 3 is annoying but must be included, since Kg is 
not Hamiltonian but satisfies 6(K2) = n(Ke)/2. 
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The proof uses contradiction and extremality. If there is a non-Hamiltonian 
graph satisfying the hypotheses, then adding edges cannot reduce the mini- 
mum degree. Thus we may restrict our attention to maximal non-Hamiltonian 
graphs with minimum degree at least n/2, where “maximal” means that adding 
any edge joining nonadjacent vertices creates a spanning cycle. 

When u # v in G, the maximality of G implies that G has a spanning path 
V1,..., U, from u = v, to v = v,, because every spanning cycle in G+ uv contains 
the new edge uv. To prove the theorem, it suffices to make a small change in 
this cycle to avoid using the edge uv; this will build a spanning cycle in G. 

If a neighbor of u directly follows a neighbor of v on the path, such as u @ 
vi41 and v © y;, then (u, vj41, Vj42,...,U, Uj, Ui-1,..., V2) iS a spanning cycle. 


u Vi Vid v 
To prove that such a cycle exists, we show that there is a common index in 


the sets § and T defined by S = {i: u © vj41} and T = {i: v © v;}. Summing 
the sizes of these sets yields 


ISUT| + |SOT| = |S] + |T| =d(u) + d(v) 2 n. 


Neither S nor T contains the index n. Thus |S U T| < n, and hence |SNT| > 1. 
We have established a contradiction by finding a spanning cycle in G; hence 
there is no (maximal) non-Hamiltonian graph satisfying the hypotheses. a 


P 


me/ 


Ore observed that this argument uses 6(G) > n(G)/2 only to show that 
d(u)+d(v) > n. Therefore, we can weaken the requirement of minimum degree 
n/2 to require only that d(u) + d(v) > n whenever u # v. We also did not 
need that G was a maximal non-Hamiltonian graph, only that G + uv was 


Hamiltonian and thereby provided a spanning u, v-path. 


7.2.9. Lemma. (Ore [1960]) Let G be a simple graph. If u,v are distinct non- 
adjacent vertices of G with d(u) + d(v) > n(G), then G is Hamiltonian if 
and only if G + uv is Hamiltonian. 


Proof: One direction is trivial, and the proof of the other direction is the same 
as for Theorem 7.2.8. | 


Bondy and Chvatal [1976] phrased the essence of Ore’s argument in a 
much more general form that yields sufficient conditions for cycles of length 
1 and other subgraphs. Here we discuss only the application to spanning cy- 
cles.. Using Lemma 7.2.9 to add edges, we can test whether G is Hamiltonian 
by testing whether the larger graph is Hamiltonian. 


7.2.10. Definition. The (Hamiltonian) closure of a graph G, denoted C(G), 
is the graph with vertex set V(G) obtained from G by iteratively adding 
edges joining pairs of nonadjacent vertices whose degree sum is at least n, 
until no such pair remains. 
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CO - S-s-e 


The graph above begins with vertices of degree 2, but its closure is Kg. 
Ore’s Lemma yields the following theorem. 


7.2.11. Theorem. (Bondy—Chvatal [1976]) A simple n-vertex graph is Hamil- 
tonian if and only if its closure is Hamiltonian. | 


Fortunately, the closure does not depend on the order in which we choose 
to add edges when more than one is available. 


7.2.12. Lemma. The closure of G is well-defined. 


Proof: Let ¢;,...,¢, and f;,..., f, be sequences of edges added in forming 
C(G), the first yielding G,; and the second Gp». If in either sequence nonadja- 
cent vertices w and v acquire degree summing to at least n(G), then the edge 
uv must be added before the sequence ends. 

Thus fi, being initially addable to G, must belong to G,. Similarly, if 
fi,..., fi-1 € E(G1), then f, becomes addable to G; and therefore belongs to 
G,. Hence neither sequence contains a first edge omitted by the other sequence, 
and we have G, C G2 and G2 C Gj. a 


telegrar 


We now have a necessary and sufficient condition to test for Hamiltonian 
cycles in simple graphs. It doesn’t help much, because it requires us to test 
whether another graph is Hamiltonian! Nevertheless, it does furnish a method 
for proving sufficient conditions. A condition that forces C(G) to be Hamiltonian 
also forces a Hamiltonian cycle in G. 

For example, the condition may imply C(G) = K,. Chvatal used this 
method to prove the best possible degree sequence condition for Hamiltonian 
cycles. Some vertex degrees can be small if others are large enough. 


7.2.13. Theorem. (Chvatal [1972]) Let G be a simple graph with vertex de- 
grees d, < --- < d,,wheren > 3. Ifi < n/2implies that d; > iord,_; >n-i 
(Chvatal’s condition), then G is Hamiltonian. 


Proof: Adding edges to form’ the closure reduces no entry in the degree se- 
quence. Also, G is Hamiltonian if and only if C(G) is Hamiltonian. Thus it 
suffices to consider the case where C(G) = G, which we describe by saying that 
G is closed. In this case, we prove that Chvatal’s condition implies that G = K,,. 

We prove the contrapositive; if G is a closed n-vertex graph that is not a 
complete graph, then we construct a value of i less than n/2 for which ChvAtal’s 
condition is violated. Violation means that at least i vertices have degree at 
most i and at least n — i vertices have degree less than n — i. 
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With G + K,, we choose among the pairs of nonadjacent vertices a pair 
u, v with maximum degree sum. Because G is closed, u # v implies that d(u) + 
d(v) <n. We choose the labels on u, v so that d(u) < d(v). Since d(u) +d(v) <n, 
we thus have d(u) < n/2. Let i =d(u). 

We need to find i vertices with degree at most i. Because we chose a non- 
adjacent pair with maximum degree sum, every vertex of V — {v} that is not 
adjacent to v has degree at most d(u), which equals i. There are n — 1 — d(v) 
such vertices, and d(u) + d(v) <n —1 yieldsn —1-—d(v) >i. 

We also need n — i vertices with degree less than n —i. Every vertex of 
V — {u} that is not adjacent to u has degree at most d(v), and we have d(v) < 
n—d(u) =n-—i. There are n — 1 —d(u) such vertices. Since d(u) < d(v), we can 
also add u itself to the set of vertices with degree at most d(v). We thus obtain 
n —i vertices with degree less than n — i. 

We have proved that d; < i and d,_; < n —i for this specially chosen i, 


which contradicts the hypothesis. | 


CC] wo D> 


so) 


7.2.14, Example. Non-Hamiltonian graphs with “large” vertex degrees. The- 
orem 7.2.13 characterizes the degree sequences of simple graphs that force 
Hamiltonian cycles. If the degree sequence fails Chvatal’s condition at i, then 


nath 


the largest we can make the terms in d;,..., d,, is 
dj =i for j <i, 
dj=n—i-1 fori+1l<j<n-i, 
dj=n—1 for j >n—i. 


Let G be a simple graph realizing this degree sequence (if it exists). The 
i vertices of degree n — 1 are adjacent to all others (the central clique in the 
figure). This already gives i neighbors to the i vertices of degree i, so they form 
an independent set: and have no additional neighbors. With degree n —i — 1, 
each of the remaining n — 2i vertices must be adjacent to all vertices except 
itself and the independent set. Thus these vertices form a clique. The only 
possible realization is (K; + Kn-2;) V K;, shown below. 

This graph is not Hamiltonian, because deleting the i vertices of degree 
n — 1 leaves a subgraph with i + 1 components. If a simple graph H is non- 
Hamiltonian and has vertex degrees d; < --- < d), then Chvatal’s result implies 
that for some i the graph (K; + Kn-9;) V K; with vertex degrees d; < --- < d, 


satisfies d; > d; for alli. a 
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7.2.15. Definition. A Hamiltonian path is a spanning path. 


Every graph with a spanning cycle has a spanning path, but P,, shows that 
the converse is not true. We could make arguments like those above to prove 
sufficient conditions for Hamiltonian paths, but it is easier to use our previous 
work and prove the new theorem by invoking a theorem about cycles. To do 
this, we use a standard transformation. 


7.2.16. Remark. A graph G has a spanning path if and only if the graph Gv Kk, 
has a spanning cycle. | 


Remark 7.2.16 applies in several of the exercises. Here we use it to derive 
the analogue for paths of Chvatal’s condition for spanning cycles. 


7.2.17. Theorem. Let G be a simple graph with vertex degrees d; < --:- < d,. If 
i < (n+1)/2 implies (d; > i or d,,1_; > n —i), then G has a spanning path. 


Proof: Let G’ = Gv K,, let n’ =n+1, and let d,,...,d_, be the degree sequence 
of G’. Since a spanning cycle in G v K, becomes a spanning path in G when the 
extra vertex is deleted, it suffices to show that G’ satisfies Chvatal’s sufficient 
condition for Hamiltonian cycles. 

Since the new vertex is adjacent to all of V(G), we have d,, = n and d; = 
dj +1 for j <n’. Fori <n'/2=(n+ 1)/2, the hypothesis on G yields 


500k 


d}=d;+1>i+1>i or 4 nei tlen—-it len’ -i. 


This is precisely Chvatal’s sufficient condition, so G’ hasa spanning cycle, and 
deleting the extra vertex leaves a spanning path in G. | 


7.2.18.* Remark, The degree requirements can be weakened under conditions 
such as regularity or high toughness. Every regular simple graph G with vertex 
degrees at least n(G)/3 is Hamiltonian (Jackson [1980]). Only the Petersen 
graph prevents lowering the threshold to (n(G) — 1)/3 (Zhu—Liv—Yu [1985], 
partly simplified in Bondy—Kouider [1988]; see also Exercise 13). 

It may be possible to lower the degree condition further when connectivity 
is high. For example, Tutte [1971] conjectured that every 3-connected 3-regular 
bipartite graph is Hamiltonian. Horton [1982] found a counterexample with 
96 vertices, and the smallest known counterexample has 50 vertices (Georges 
[1989]), but stronger conditions of this sort may suffice. | 

Our last sufficient condition for Hamiltonian cycles involves connectivity 
and independence, not degrees. The proof yields a good algorithm that con- 
structs a Hamiltonian cycle or shows that the hypothesis is false. 


7.2.19. Theorem. (Chvatal—Erdés [1972]) If«(G) > a(G), then G has a Hamil- 
tonian cycle (unless G = Ko). 


Proof: With G # Ko, the conditions require «(G) > 1. Suppose that «(G) > 
a(G). Let k = «(G), and let C be a longest cycle in G. Since 5(G) > «(G), and 
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every graph with 5(G) > 2 has a cycle of length at least 5(G) + 1 (Proposition 
1.2.28), C has at least k + 1 vertices. 

Let H be a component of G — V(C). The cycle C has at least k vertices with 
edges to H; otherwise, deleting the vertices of C with edges to H contradicts 
K(G) =k. Let u4,..., uz, be k vertices of C with edges to H, in clockwise order. 

For i = 1,...,k, let a; be the vertex immediately following u; on C. If any 
two of these vertices are adjacent, say a; <> a;, then we construct a longer cycle 
by using a;a;, the portions of C from a; to u; and a; to u;, and a u;,u;-path 
through H (see illustration). 

If a; has a neighbor in H, then we can detour to H between u; and a; on C. 


Thus we also conclude that no a; has a neighbor in H. Hence {a,..., a} plus a 
vertex of H forms an independent set of size k + 1. This contradiction implies 
that C is a Hamiltonian cycle. | 


uy 


a) 
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7.2.20.* Remark. Most sufficient conditions for Hamiltonian cycles general- 
ize to conditions for long cycles. The circumference of a graph is the length 
of its longest cycle. A weaker form of a sufficient condition for spanning cycles 
may force a long cycle. Dirac [1952b] proved the first such result: a 2-connected 
graph with minimum degree k has circumference at least min{n, 2k}. Propo- 
sition 1.2.28 only guarantees a cycle of length at least k + 1. Most long-cycle 
results are more more difficult than the corresponding sufficient conditions for 
Hamiltonian cycles (see Lemma 8.4.36—Theorem 8.4.37). | 


CYCLES IN DIRECTED GRAPHS (optional) 


The theory of cycles in digraphs is similar to that of cycles in graphs. For 
a digraph G, let 6-(G) = mind™(v) and 5*(G) = mind*(v). The arguments 
of Chapter 1 using maximal paths guarantee paths of length k and cycles of 
length k + 1, where k = max{d~(G), 5+(G)}. 

Every complete graph is Hamiltonian, but orientations of compiete graphs 
are more complicated. The-necessary condition of 2-connectedness becomes a 
necessary condition of strong connectedness for spanning cycles in digraphs. 
For tournaments, this necessary condition is also sufficient (Exercise 45). 
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For arbitrary digraphs, we prove an analogue of Dirac’s theorem (Theo- 
rem 7.2.8). Indeed, it yields Dirdc’s theorem as a special case (Exercise 49). 
Meyniel [1973] subtantially strengthened the theorem by weakening the hy- 
pothesis (Theorem 8.4.42). 


7.2.21. Definition. A digraph is strict if it has no loops and has at most one 
copy of each ordered pair as an edge. 


7.2.22. Theorem. (Ghouila-Houri [1960]) If D is a strict digraph, and 
min{5+(D), 5~(D)} => n(D)/2, then D is Hamiltonian. 


Proof: Again we use contradiction and extremality. In an n-vertex counterex- 
ample D, let C be a longest cycle, with length /. As we have observed, / > 
max{5d+,5~} > n/2. Let P bea longest path in D — V(C), beginning at u, ending 
at w, and having length m > 0. Now! > n/2andn >1+m+1implym <n/2. 

Let S be the set of predecessors of u on C, and let T be the set of successors 
of w on C. By the maximality of P, every predecessor of u and successor of 
w lies in V(C) U V(P). Thus S and T each have size at least min{5t, 5~} — m, 
which is at least > n/2 — m and hence is positive. Thus S and T are nonempty. 

The maximality of C guarantees that the distance along C from a vertex 
u’ € S to a vertex w’ € T must exceed m+ 1. Otherwise, traveling along P 
instead of C from u’ to w’ yields a longer cycle. Hence we may assume that 
every vertex of S is followed on C by more than m vertices not in T. 


we 
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If the distance between suecessive vertices of S along C is always at most 
m + 1, then there is no legal place to put a vertex of T. Since both S and T 
are nonempty, we may thus assume there is a vertex of S followed on C by at 
least m + 1 vertices not in S. These are forbidden from 7, as is the immediate 
successor on C of all the other vertices of S. 

Thus at least |S| - 1+m+1 > n/2 vertices of C are not in 7. Together 
withthe vertices that are in 7, this yields |V(C)| > n — m, which contradicts 
1<n—m-—1. The contradiction implies that C must be aspanning cycle. 5 


EXERCISES 


7.2.1. (—) For which values of r is K,,, Hamiltonian? 
7.2.2. (—) Is the Grétzsch graph (Example 5.2.2) Hamiltonian? 
7.2.3. (—) For n > 1, prove that K,,,, has (n — 1)!n!/2 Hamiltonian cycles. 
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7.2.4. (—) Prove that G has a Hamiltonian path only if for every § C V(G), the number 
of components of G — S is at most |S| + 1. 


7.2.5. Prove that every 5-vertex path in the dodecahedron lies in a Hamiltonian cycle. 


7.2.6. (!) Let G be a Hamiltonian bipartite graph, and choose x, y € V(G). Prove that 
G — x — y has a perfect matching if and only if x and y are on opposite sides of the 
bipartition of G. Apply this to prove that deleting two unit squares from an 8 by 8 
chessboard leaves a board that can be partitioned into 1 by 2 rectangles if and only if 
the two missing squares have opposite colors. 


7.2.7. A mouse eats its way through a 3 x 3 x 3 cube of cheese by eating all the 1 x 1x 1 
subcubes. If it starts at a corner subcube and always moves on to an adjacent subcube 
(sharing a face of area 1), can it do this and eat the center subcube last? Give a method 
or prove impossible. (Ignore gravity.) 


7.2.8. (!) On a chessboard, a knight can move from one square to another that differs 
by 1 in one coordinate and by 2 in the other coordinate, as shown below. Prove that no 
4 x n chessboard has a knight’s tour: a traversal by knight’s moves that visits each 
square once and returns to the start. (Hint: Find an appropriate set of vertices in the 
corresponding graph to violate the necessary condition.) 


7.2.9. Construct an infinite family of non-Hamiltonian graphs satisfying the necessary 
condition of Proposition 7.2.3. 


7.2.10. (!) Hamiltonian vs. Eulerian. 

a) Find a 2-connected non—Eulerian graph whose line graph is Hamiltonian. 

b) Prove that L(G) is Hamiltonian if and only if G has a closed trail that contains 
at least one endpoint of each edge. (Harary and Nash-Williams [1965]) 


7.2.11. Construct a 3-regular 3-connected graph whose line graph is not Hamiltonian. 
(Hint: Replace each vertex in the Petersen graph with an appropriate graph and apply 
Exercise 7.2.10.) 


7.2.12. Determine whether the graph below is Hamiltonian. 
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7.2.13. Let G be the 3-regular graph obtained from the Petersen graph by replacing one 
vertex with a triangle, matching the vertices of the triangle to the former neighbors of 
the deleted vertex. Prove that G is not Hamiltonian. (Comment: Except for this graph 
and the Petersen graph, every 2-connected, k-regular graph with at most 3k + 3 vertices 
is Hamiltonian.) (Hilbig [1986]) 


7.2.14. A graph G is uniquely k-edge-colorable if all proper k-edge-colorings of G induce 
the same partition of the edges. Prove that every uniquely 3-edge-colorable 3-regular 
graph is Hamiltonian. (Greenwell—Kronk [1973]) 


7.2.15. Place n points around a circle. Let G, be the 4-regular graph obtained by joining 
each point to the nearest two points in each direction. If n > 5, prove that G, is the 
union of two Hamiltonian cycles. 


7.2.16. For k > 3, let G, be the graph obtained from two disjoint copies of Kx 4-2 by 
adding a matching between the two “partite sets” of size k. Determine all values of k 
such that G, is Hamiltonian. 


ith_ books 


7.2.17. (!) Prove that the cartesian product of two Hamiltonian graphs is Hamiltonian. 
Conclude that the k-dimensional cube Q, is Hamiltonian for k > 2. 


7.2.18. Prove that the cartesian product of two graphs with Hamiltonian paths fails to 
have a Hamiltonian cycle if and only if both graphs are bipartite and have odd order, in 
which case the product has a Hamiltonian path. 


7.2.19. (+) For each odd natural number k, construct a k — 1-connected k-regular simple 
bipartite graph that is not Hamiltonian. 


7.2.20. (!) The kth power of a simple graph G is the simple graph G* with vertex set 
V(G) and edge set {uv: dg(u, v) < k}. 

a) Suppose that G — x has at least three nontrivial components in each of which x 
has exactly one neighbor. Prove that G? is not Hamiltonian. (Hint: Consider the second 
graph in Example 7.2.5.) 

b) Prove that the cube of each connected graph (with at least three vertices) is 
Hamiltonian. (Hint: Reduce this to the special case of trees, and prove it for trees by 
proving the stronger result that if xy is an edge of the tree 7, then T* has a Hamiltonian 
cycle using the edge xy. Comment: Fleischner [1974] proved that the square of each 
2-connected graph is Hamiltonian.) 


7.2.21. Let n = k(2/ + 1). Construct a non-Hamiltonian complete k-partite graph with 


: sae nk-1 2 : 
n vertices and minimum degree 5 “> 5,,- (Snevily) 


7.2.22. Let G(k, t) be the class of connected k-partite graphs in which each partite set 
has size t and each subgraph induced by two partite sets is a matching of sizer. Fork > 4 
and r > 4, construct a graph in G(k, t) that is not Hamiltonian. (Hint: There is a graph 
in G(4, 4) with a 3-set whose deletion leaves four components; generalize this example. 
Comment: G(3, t) = {C3,}, and also every graph in G(k, 3) is Hamiltonian.) (Ayel [1982]) 
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7.2.23. (*) Prove that the Petersen graph has toughness 4/3. 


7.2.24. (*) Let t(G) denote the toughness of G. 

a) Prove that t(G) < «(G)/2. (Chvatal [1973]) 

b) Prove that equality holds in part (a) for claw-free graphs. (Hint: Consider a set 
S such that |S| = t(G) -c(G — S).) (Matthews—Sumner [1984]) 


7.2.25. (!) Let G be a simple graph that is not a forest and has girth at least 5. Prove 
that G is Hamiltonian. (Hint: Use Ore’s condition.) (N. Graham) 


7.2.26. (!) Prove that if G fails Chvatal’s condition, then G has at least n — 2 edges. 
Conclude from this that the maximum number of edges in a simple non-Hamiltonian 
n-vertex graph is ("3’) +1. (Ore [1961], Bondy [1972b]) 


7.2.27. Prove directly by induction on n that the maximum number of edges in a simple 
non-Hamiltonian n-vertex graph is ("3’) +1. 


7.2.28. Generalization of the edge bound. 

a) Let f(@i) = 2i? —i+ (n —i)(n —i — 1), and suppose that n > 6k. Prove that on the 
interval k <i <n/2, the maximum value of f (i) is f(x). 

b) Let G be a simple graph with minimum degree k. Use part (a) and Chvatal’s 
condition to prove that if G has at least 6k vertices and has more than (“9 *) +k? edges, 
then G is Hamiltonian. (Erdés [1962]) 


7.2.29. vy Let G be a simple graph with vertex degrees d, < --- < d,, and let d, < 

.. < d! be the vertex degrees in G. Prove:that if d; > d! for alli < n/2, then G hasa 
Hamiltonian path. Conclude that every simple graph isomorphic to its complement has 
a Hamiltonian path. (Clapham [1974]) = 


~ 
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7.2.30. Obtain Lemma 7.2.9 (sufficiency of Ore’s condition) from Theorem 7.2.13 (suffi- 
ciency of Chvatal’s condition). (Bondy [1978]) 


7.2.31. (!) Prove or disprove: If G isa simple graph with at least three vertices, and G 
has at least a(G) vertices of degree n(G) — 1, then G is Hamiltonian. 


7.2.32. (+) Suppose that n is even and G is a simple bipartite graph with partite sets 
X,Y of size n/2. Let the vertex degrees of G be d),...,d,. Let G’ be the supergraph of 
G obtained by adding edges so that G[Y] = K,;2. 

a) Prove that G is Hamiltonian if and only if G’ is Hamiltonian, and describe the 
relationship between the degree sequences of G and G’. 

b) Suppose that d > k or d,j2 > n/2 —k whenever k < n/4. Prove that G is 
Hamiltonian. (Hint: Assume that the degree sequence of G’ fails Chvatal’s condition for 
some i < n/2, and obtain a contradiction.) (Chvatal [1972]) 


7.2.33. (!) A graph is Hamiltonian-connected if for every pair of vertices u, v there 
is a Hamiltonian path from u to v. Prove that a simple graph G is Hamiltonian if 
e(G) > ee) + 2 and Hamiltonian-connected if e(G) > ee) +8. (Proving the two 
together permits a simpler proof.) (Ore [1963]) 


7.2.34. Necessary condition for Hamiltonian-connected. (Moon [1965a]) 

a) Prove that every Hamiltonian-connected graph G with at least four vertices has 
at least [3n(G)/2] edges. 

b) Prove that the bound in part (a) is best possible by showing that C,, 0 Ke is 
Hamiltonian-connected if m is-odd. 


7.2.35. (!) Sufficient condition for Hamiltonian-connected. (Ore [1963]) 
a) Prove that a simple graph G is Hamiltonian-connected if x # y implies d(x) + 
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d(y) > n(G). (Hint: Prove that appropriate graphs related to G are Hamiltonian by 
considering their closures.) 

‘b) Prove that part (a) is sharp by constructing, for each even n greater than 2, a 
simple n-vertex graph with minimum degree n/2 that is not Hamiltonian-connected. 


7.2.36. Las Vergnas’ condition for a simple n-vertex graph is the existence of a vertex 
ordering v;,..., v, such that there is no nonadjacent pair v;, v; satisfyingi < j,d(v;) <i, 
d(vj) < j,d(uj) +d(vj) <n, andi+ j >n. Las Vergnas [1971] proved that this condition 
is sufficient for the existence of a spanning cycle. 

a) Prove that Chvatal’s condition (Theorem 7.2.13) implies Las Vergnas’ condition, 
which means that Las Vergnas’ theorem strengthens Chvatal’s theorem. 

b) Prove that each of the graphs below fails Chvatal’s condition but has a complete 
graph as its Hamiltonian closure. Prove that the smaller graph satisfies Las Vergnas’ 
condition but the larger one does not. 


NN 


7.2.37. For @ # S c V(G), let 1(S) = [SN N(S)|/|S|. Let 6(G) = minz(S). Lu [1994] 
proved that if 9(G)n(G) > a(G), then G is Hamiltonian. Prove that «(G) > a(G) implies 
6(G)n(G) > a(G). (Comment: This shows that Lu’s theorem implies the Chvatal—Erdés 
Theorem and is a stronger result.) 


© ./math 


7.2.38. (!) Long paths and cycles. Let G bea connected simple graph with 6(G) =k > 2 
and n(G) > 2k. 

a) Let P be a maximal path in G (not a’ subgraph of any longer path). If n(P) < 2k, 
prove that the induced subgraph G[V(P)] has a spanning cycle (this cycle need not have 
its vertices in the same order as P). 

b) Use part (a) to prove that G has a path with at least 2k + * vertices. Give an 
example for each odd value of n to show that G need not have a cycle with more than 
k + 1 vertices. 


am. 


7.2.39. Prove that ifa simple graph G has degree sequence d; < --- < d, andd,+dz2 <n, 
then G has a path of length at least d, + dz + 1 unless G is the join of n — (d; + 1) isolated 
vertices with a graph on d; + 1 vertices or G = pKz, V K, for some p >= 3. (Ore [1967b]) 


7.2.40. (!) Dirac [1952b] proved that every 2-connected simple graph G has a cycle of 
length at least min{n(G), 26(G)}. Use this to prove that every 2k-regular graph with 
4k + 1 vertices is Hamiltonian. (Nash-Williams) 


7.2.41. Scott Smith conjectured that any two longest cycles in a k-connected graph have 
at least k common vertices. The approach below works for small k. 

a) Suppose that G is a 4-regular graph with n vertices that is the union of two cycles 
(multiple edges may arise). Let G’ be the 4-regular graph on n + 2 vertices obtained from 
G by subdividing two edges and adding a double édge between the two new vertices. 
Show that G’ is also the union of two spanning cycles ifn < 5. 

b) Use part (a) to conclude that any pair of longest cycles in a k-connected graph 
intersect in at least k points if k < 6. (Smith, Burr) 


7.2.42. (+) Let G be an Eulerian graph. Let V’ be the set of Eulerian circuits of G, 
considering a circuit and its reversal to be the same. Let G’ be the graph with vertex 
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‘set V’ such that two circuits are adjacent if and only if one arises from the other by 
reversing the edge order on a proper closed subcircuit. Prove that G’ is Hamiltonian 
if A(G) < 4. (Hint: Use induction on the number of vertices of degree 4, proving that 
there is a Hamiltonian cycle through every edge of G’. Comment: The conclusion also 
holds without restriction on A(G).) (Xia [1982], Zhang—Guo [1986]) 


7.2.43. Prove that the Eulerian circuit graph G’ of Exercise 7.2.42 is regular, and derive 
a formula for its vertex degree. Compare 5(G’) and n(G’) when n(G) = 2 to show that 
the preceding problem cannot be solved by applying general results on Hamiltonicity of 
regular graphs with specified degree. 


7.2.44. Prove that every tournament has a Hamiltonian path (a spanning directed 
path). (Hint: Use extremality). (Rédei [1934]) 


7.2.45. Let T be astrong tournament. For each u € V(7’) and eachk suchthat 3 <k <n, - 
prove that u belongs to a cycle of lengthk in T. (Hint: Use induction on k.) (Moon [1966]) 


7.2.46. Let.G be a 7-vertex tournament in which every vertex has outdegree 3. Use 
Exercise 7.2.45 to prove that G has two vertex-disjoint cycles. 


7.2.47. (+) Prove that every tournament has a Hamiltonian path that is not contained 
in a Hamiltonian cycle, except the cyclic tournament on three vertices and the tourna- 
ment 7; on five vertices drawn below. (Hint: Induction works, but some care is needed 
to prove the claim for six vertices. In all cases, find the desired configuration or G = 73.) 
(Griinbaum, in Harary [1969, p211]) 


7.2.48. (*) Prove that Theorem 7.2.22 is best possible by showing that the strictness 
condition on the digraph cannot be weakened to allow loops. In particular, construct 
for each even n an n-vertex digraph D that is not Hamiltonian even though at most one 
copy of each ordered pair is an edge and min{é~ (D), 6*+(D)} > n/2. 


7.2.49. (x) Obtain Theorem 7.2.8 (sufficiency of Dirac’s condition in graphs) from The- 
orem 7.2.22 (sufficiency of Ghouila-Houri’s condition on digraphs). (Hint: ‘Transform a 
simple graph G into a strict digraph by replacing each edge with a pair of directed edges 
in opposite directions.) 


7.3. Planarity, Colorings, and Cycles 


We return to the Four Color Problem to explore its historical relationship 
with the problems of edge-coloring and Hamiltonian cycles. We then consider 
ways in which the problem generalizes. 
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TAIT’S THEOREM 


In 1878, Tait proved a theorem relating face-coloring and edge-coloring of 
plane graphs, and he used this in an approach to the Four Color Theorem. This 
stimulated interest in edge-coloring. We first define face-coloring precisely. 


7.3.1. Definition. A proper face-coloring of a 2-edge-connected plane graph 
is an assignment of colors to its faces so that faces having a common edge 
in their boundaries have distinct colors. 


We often think of a face-ccloring as a coloring of the dual graph. For this 
reason, we restrict our attention to face-colorings of 2-edge-connected graphs. 
When a plane graph has a cut-edge, its dual has a loop. We say that graphs 
with loops do not have proper colorings. In a plane graph with a cut-edge, a 
face shares a boundary with itself and is thus uncolorable. 

Since adding edges does not make ordinary coloring easier, to prove the 
Four Color Theorem it suffices to prove that all triangulations are 4-colorable. 
Equivalently, we could show that all duals of triangulations are 4-face-colorable. 
The dual G* of a plane triangulation G is a 3-regular, 2-edge-connected plane 
graph (Exercise 6.1.11). Tait showed that for such graphs, proper 4-face- 
colorings are equivalent to pruper 3-edge-colorings. 


7.3.2. Theorem. (Tait [1878]) A simple 2-edge-connected 3-regular plane graph 
is 3-edge-colorable if and only if it is 4-face-colorable. 


Proof: Let G be such a graph. Suppose first that G is 4-face-colorable; we 
obtain a 3-edge-coloring. Let the four colors be denoted by binary ordered pairs: 
co = 00, c1 = 01, cp = 10, cz = 11. Color E(G) by assigning to the edge between 
faces with colors c; and c; the color obtained by adding c; and c; coordinatewise 
using addition modulo 2. (Thus cz + c3 = ci, for example.) We show that this is 
a proper 3-edge-coloring. 

Because G is 2-edge-connected, each edge bounds two distinct faces. Hence 
the color 00 never occurs as a sum. We check that the edges at a vertex receive 
distinct colors. At vertex v the faces bordering the three incident edges must 
have distinct colors {c;, c;, c.}, as illustrated below. If color 00 is not in this set, 
then the sum of any two of these is the third, and hence {c;. c;, c,} is also the 
set of colors on the edges. If c, = 00, then c; and c; appear on two of the edges, 
and the third receives color c; + c;, which is the color not in {c;, cj, cx}. 
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For the converse, suppose that G has a proper 3-edge-coloring using colors 
a, b,c (shown bold, solid, and dashed). Let E,, E,, E. be the edge sets having 
the three colors, respectively. We construct a 4-face-coloring using the four 
colors defined above. Since G is 3-regular, each color appears at every vertex, 
and the union of any two of E,, E,, E, is 2-regular, which makes it a union of 
disjoint cycles. Each face of this subgraph is a union of faces of the original 
graph. Let H; = E, UE, and H2 = E, U E,. To each face of G, assign the color 
whose ith coordinate (i € {1, 2}) is the parity of the number of cycles in H; that 
contain it (0 for even, 1 for odd). 

We claim that this is a proper 4-face-coloring, as illustrated above. Faces 
F, F’ sharing an edge e are distinct faces, since G is 2-edge-connected. Edge e 
belongs to a cycle C in at least one of H;, H2 (in both if e has color b). By the 
Jordan Curve Theorem, one of F, F’ is inside C and the other is outside. All 
other cycles in H; and Hp fail to separate F and F’, leaving them on the same 
side. Hence if e has color a, c, or b, then the parity of the number of cycles 
containing F and F’ is different in H;, in Mo, or in both, respectively. Thus F 
and F’ receive different colors in the face-coloring we have constructed. | 


Due to this theorem, a proper 3-edge-coloring of a 3-regular graph is called a 
Tait coloring. The problem of showing that every 2-edge-connected 3-regular 
planar graph is 3-edge-colorable reduces to showing that every 3-connected 
3-regular planar graph is 3-edge-colorable. 


7.3.3.* Lemma. If G is a 3-regular graph with edge-connectivity 2, then G has 
subgraphs Gy, G2 and vertices uy, v1 € V(G1) and ug, ve € V(G2) such that 
U, # v4, also ug # ve, and G consists of G1, Gz and a ladder of some length 
joining Gy, G2 at 4, v4, U2, v2 as shown below. 


Proof: If G has an edge cut of size 2 in which the two edges are incident, 
then the third edge incident to their common vertex is a cut-edge, contradicting 
x’ = 2. Hence we may assume that the four endpoints in our minimum edge 
cut xy,uv are distinct. If x #4 y and u # v, then these are the four desired 
vertices and the ladder has only these two edges. . 

When x < y, we extend the ladder (a similar argument applies when u <> 
v). Let w be the third neighbor of x and z the third neighbor of y. If w = z, then 
the third edge incident to this vertex is a cut-edge. Hence w ¥ z and the ladder 
extends. If w # z, then we are finished in this direction; otherwise, we repeat 
the argument till we obtain a nonadjacent pair at the base of the ladder. | 


7.3.4.* Theorem. All 2-edge-connected 3-regular simple planar graphs are 3- 
edge-colorable if and only if all 3-connected 3-regular simple planar graphs 
are 3-edge-colorable. 
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Proof: The second family is contained in the first. Hence it suffices to show 
that 3-edge-colorability for all graphs in the smaller family implies it also for 
the larger family. We use induction on n(G). 

Basis step (n(G) = 4): The only 2-edge-connected 3-regular simple planar 
graph with at most 4 vertices is K4, which is 3-edge-colorable. 

Induction step (n(G) > 4): Since «(G) = «’(G) when G is 3-regular (The- 
orem 4.1.11), we may restrict our attention to 3-regular graphs with edge- 
connectivity 2. Lemma 7.3.3 gives us a decomposition of G into Gj, Go, anda 
ladder joining them. The length of the ladder is the distance from G, to Go. 

Both G1+u v1 and Gg+ugv2 are 2-edge-connected and 3-regular. By the in- 
duction hypothesis, they are 3-edge-colorable; let f; be a proper 3-edge-coloring 
of G; + u;v;. Permute names of colors so that f,(uiv) = 1 and so that fo(ugv2) 
is chosen from {1, 2} to have the same parity as the length of the ladder. 

Returning to G, color each G; asin f;. Beginning from the end of the ladder 
at Gj, color the rungs of the ladder with 3, and color the paths forming the sides 
of the ladder alternately with 1 and 2. The edges of the ladder at u; and v; now 
have the color f;(u;v;). Thus we have assembled a proper 3-edge-coloring of G. 

a 


Thus the!Four Color Theorem reduces to finding Tait colorings of 3-edge- 
connected 3-regular planar graphs. The statement of their existence was known 
as Tait’s conjecture and is equivalent to the Four Color Theorem. 


a 


GRINBERG’S THEOREM 


telegram.me/ 


Every Hamiltonian 3-regular graph has a Tait coloring (Exercise 1). Tait 
believed that this completed a proof of the Four Color Theorem, because he 
assumed that every 3-connected 3-regular planar graph is Hamiltonian. Not 
until 1946 was an explicit counterexample found, although the gap in the proof 
was noticed earlier. Later, Grinberg [1968] discovered a simple necessary con- 
dition that led to many 3-regular 3-connected non-Hamiltonian planar graphs, 
including the Grinberg graph of Exercise 16. 


7.3.5. Theorem. (Grinberg [1968]) If G is a loopless plane graph having a 
Hamiltonian cycle C, and G has f/ faces of length i inside C and /f;’ faces 
of length i outside C, then 0, (i — 2)(f; — f/) = 90. 


Proof: Considering the faces inside and outside C separately, we want to show 
that >0;(i — 2) f/ = >0,(i — 2) Ff’. No changes on one side affect the sum on the 
other side. Furthermore, we can switch inside and outside by projecting the 
embedding onto a sphere and puncturing a face inside C. 

Hence we need only show that )°;(i — 2) f/ is constant. When there are’no 
inside edges, the sum is n — 2. With this as the basis step, we prove by induction 
on the number of inside edges that the sum is always n — 2. 

Suppose that }°,(i — 2) f/ =n — 2 when there are k edges inside C. We can 
obtain any graph with k + 1 edges inside C by adding an edge to such a graph. 
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The added edge cuts a face of some length r into two faces of lengths s and t. 
We have s +t =r-+ 2, because the new edge contributes to both new faces and 
each edge on the old face contributes to one new face. 

No other contribution to the sum changes. Since (s — 2) + (t — 2) = (r — 2), 
the contribution from these faces also remains the same. By the induction 
hypothesis, the sum is n — 2. | 


Being a necessary condition, Grinberg’s condition can be used to show that 
graphs are not Hamiltonian. The arguments can often be simplified using 
modular arithmetic. Two numbers that are not congruent mod k are not equal. 

We apply this to the first known non-Hamiltonian 3-connected 3-regular 
planar graph (Tutte [1946]). Tutte used an ad hoc argument to prove that this 
graph is not Hamiltonian. For many years it was the only known example (see 
Exercise 17 for the smallest now known). 


7.3.6. Example. Grinberg’s condition and the Tutte graph. The Tutte graph G 
appears on the left below. Let H denote each component obtained by deleting 
the central vertex and the three long edges. Since a Hamiltonian cycle must 
visit the central vertex of G, it must traverse one copy of H along a Hamiltonian 
path joining the other entrances to H, which we call x and y. 

We therefore study a graph that has a Hamiltonian cycle if and only if H 
has a Hamiltonian x, y-path. Such a graph H’ (on the right below) is obtained 
by adding an x, y-path of length two through a new vertex. 


telegram 
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The plane graph H’ has five 5-faces, three 4-faces, and one 9-face. Grin- 
berg’s condition becomes 2a, + 3a5 + 7ag = 0, where a; = f/ — f;’. Since the 
unbounded face is always outside, the equation reduces mod 3 to 2a, = 7 mod 8. 
Since fj + f;' = 3, the possibilities for a, are +3, +1, —1, —3. The only choice 
satisfying 2a, = 7 mod 3 is ag = —1, which requires that two of the 4-faces lie 
outside the Hamiltonian cycle. However, the 4-faces having having a vertex of 
degree 2 cannot lie outside the cycle, since the edges incident to the vertex of 
degree 2 separate the face from the outside face. 

We can reach a contradiction faster by subdividing one edge incident to 
each vertex of degree 2. This does not change. the existence of a spanning cy- 
cle. The resulting graph ‘has seven 5-faces, one 4-face, and one 11-face. The 
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required equation becomes 2 - (+1) = 9 — 3as, which has no solution since the 
left side is not a multiple of 3. a 


We have not presented a systematic procedure for proving the nonexistence 
of solutions to equations with integer variables. Our arguments involving di- 
visibility are merely tricks to avoid listing cases, but such tricks often work. 

High connectivity makes it harder to avoid spanning cycles. Tutte [1956] 
(extended by Thomassen [1983]) proved that every 4-connected planar graph 
is Hamiltonian. Barnette [1969] conjectured that every planar 3-connected 
3-r°gular bipartite graph is Hamiltonian. 


SNARKS (optional) 


Another approach to the Four Color Theorem is to study which 3-regular 
graphs are 3-edge-colorable. In a discussion focusing on 3-regular graphs and 
graphs without cut-edges, it is convenient to have simple adjectives to descrihe 
these properties. 


7.3.7. Definition. A bridgeless graph is a graph without cut-edges. A cubic 
graph is a graph that is regular of degree 3. 


7.3.8. Conjecture. (3-edge-coloring Conjecture—Tutte [1967]) Every bridge- 
less cubic non-3-edge-colorable graph contains a subdivision of the Petersen 
graph. Ey 


Conjecture 7.3.8 has been proved! Like the Four Color Theorem, its 
computer-assisted proof uses discharging methods. The proof will appear in 
a series of five papers by Robertson, Sanders, Seymour, and Thomas [2001]. 

Since every subdivision of the Petersen graph is nonpianar, Conjecture 
7.3.8 implies Tait’s Conjecture and hence the Four Color Theorem. One natu- 
ral approach to the conjecture, like the idea of reducibility for the Four Color 
Theorem, is to derive properties that a minimal counterexample must have. In 
this language, Theorem 7.3.4 says that a minimal counterexample must be 3- 
edge-connected. In the next lemma, we make this statement precise and obtain. 
several other properties. 


7.3.9. Definition. A trivial edge cut is an edge cut whose deletion isolates a 
single vertex. Other edge cuts are nontrivial. 


7.3.10. Lemma. If a non-3-edge-colorable cubic graph G has connectivity 2 or 
girth less than 4 or a nontrivial 3-edge cut, then G contains a subdivision 
of a smaller non-3-edge-colorable cubic graph. 


Proof: Suppose first that G has an edge cut of size 2. As discussed in Lemma 
7.3.3, these edges have no common vertices. Deleting the edge cut and adding 
one edge to each piece yields cubic graphs G; + u,v; and G2 + ugve. As argued 
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in Theorem 7.3.4, at least one of these graphs is not 3-edge-colorable. Since 
the added edge can be replaced by a path through the other piece, G contains a 
subdivision of this smaller non-3-edge-colorable graph. 

Next suppose that G contains a triangle. Let G’ be the graph obtained from 
G by contracting the triangle to a single vertex. A proper 3-edge-coloring of G’ 
could be expanded into a proper 3-edge-coloring of G as shown below. Also, G 
contains a subdivision of G’, obtained by deleting one edge of the triangle. 

Suppose that G contains a 4-cycle but no triangle. Let G’ be the cubic graph 
obtained from G by deleting two opposite edges of the 4-cycle and replacing the 
resulting paths of length 3 with single edges. Since G has no triangle, the 
new edges are not loops. A proper 3-edge-coloring of G’ yields a proper 3-edge- 
coloring of G via the two cases shown below. Also G contains a subdivision of 
G', so G’ is the desired smaller graph. 
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Finally, suppose that G contains a nontrivial 3-edge cut [S, S]. Since we 
may assume that G is 3-edge-connected, the three edges of the cut are pairwise 
disjoint. The two graphs obtained by contracting G[S] or G[S] to a single vertex 
are also 3-regular. If both are 3-edge-colorable, then the colors can be renamed 
to agree on the edges of the cut, yielding a proper 3-edge-coloring of G. Thus at 
least one of these graphs is not 3- edge-colorable. 

It remains only to show that G contains a subdivision of G[S] (and similarly 
of G[S]). Let a, b, c be the endpoints i in 'S of the edges in the cut. Since G is 3- 
edge-connected, the cut is a bond, and G[S]} is connected (Proposition 4.1.15). 
Thus G[S] contains an a, b-path P and a path from c to P. Adding these paths 
and the edges of the cut to G[S] completes a subdivision of G[S]. a 


CHE )- CBE) 
7.3.11. Definition. A snark is a 2-edge-connected 3-regular graph that is not 


3-colorable, has girth at least 5, and has no non-trivial 3-edge cut. A prime 
snark is one that contains no subdivision of a smaller snark. 


In this language, we have reduced Tutte’s 3-edge-coloring Conjecture to the 
statement that the Petersen graph is the only prime snark. Again, we note that 
the conjecture has been proved (Robertson—Sanders—Seymour—Thomas [2001]). 

After the Petersen graph in 1898, by 1975 only three more snarks had 
been found: the 18-vertex Blanusa [1946] snark, the 210-vertex Descartes 
[1948] snark, and the 50-vertex Szekeres [1973] snark. This prompted Martin 
Gardner [1976] to invent the term “snark”, evoking the rarity of the creature 
in Lewis Carroll’s “The Hunting of the Snark”. 
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Isaacs [1975] then showed that the earlier snarks arise from the Petersen 
graph via an operation that generates infinite families of snarks. 


7.3.12. Definition. The dot product of cubic graphs G and H is the cubic 
graph formed from G+ H by deleting disjoint edges uv and wx from G, 
deleting adjacent vertices y and z from H, and adding edges from u and v 
to Ny(y) — {z} and from w and x to Ny(z) — {y}. 


AZ. 
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The dot product of two snarks is a snark (Exercise 23). Applying it to two 
copies of the Petersen graph yields the BlanuSa snark shown below. This graph 
has a non-trivial 4-edge cut. Kochol [1996] introduced a more general operation 
that yields snarks with large girth and higher connectedness properties. 


a 
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7.3.13. Example. The flower snarks. Isaacs also found an explicit infinite fam- 
ily of snarks (Exercise 21) that don’t arise via the dot product. Independently 
discovered by Grinberg, they have 4k vertices, for odd k > 5. 

Begin with three disjoint k-cycles. Let {x;}, {yi}, {z:} be the three vertex 
sets, indexed cyclically. For each i add a vertex w; with N(w;) = {x;, y;, zi}. 
The resulting graph G, is 3-edge-colorable. Let H, be the graph obtained by 
replacing the edges x,x; and y,y; with x,y; and y,x;. Ifk is odd and k > 5, then 
H, is a snark. If k is even, then H, is 3-edge-colorable. The drawing of H; in 
which {z;} is a central cycle suggests the name “flower snark”. a 
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FLOWS AND CYCLE COVERS (optional) 


Tait’s Theorem (Theorem 7.3.2) states that 3-edge-colorability and 4-face- 
colorability are equivalent for plane triangulations. When extending this be- 
yond planar graphs, we need a concept that makes sense for all graphs and 
is equivalent to 4-face-coloring on plane graphs. Additional information about 
this topic (and about snarks) appears in the monograph by Zhang [1997]. 


7.3.14, Definition. A flow on a graph G is a pair (D, f) such that 
1) D is an orientation of G, 
2) f is a weight function on E(G), and 
3) each v € V(G) satisfies ) yewt) f(UW) = Levens f (uv). 
A k-flow is an integer-valued flow such that | f(e)| < k —1 for all e € E(G). 
A flow is nowhere-zero or positive if f(e) is nonzero or positive, respec- 
tively, for all e € E(G). 


The usage of “flow” here is somewhat different from that in Chapter 4. In 
both contexts, the word “flow” suggests the conservation constraints imposed 
at each vertex. The bound of k — 1 on flow value evokes the notion of capacity. 

We can alter the orientation to make all weights positive. 


_boo 


7.3.15. Proposition. For a graph G, the following are equivalent: 
A) G has a positive k-flow. 
B) G has a nowhere-zero k-flow. 
C) G has a nowhere-zero k-flow for each orientation of G. 
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Proof: Simultaneously changing the orientation of an edge and the sign of its 
weight does not affect the conservation constraints. | 


Thus the existence of a nowhere-zero k-flow does not depend on the choice 
of the orientation. We can also take linear combinations of flows. 


7.3.16. Proposition. If (D, f1),...,(D, f,) are flows on G, and g = )j_, aff, 
then (D, g) is a flow on G. 


Proof: For each v € V(G), the net flow out of v under each f; is zero, and hence 
it is also zero under g. | 


7.3.17. Proposition. For a flow on G, the net flow out of any set S C V(G) is 
zero. Thus a graph with a nowhere-zero flow has no cut-edge. 


Proof: We sum the net flows out of vertices of S. Edges leaving S contribute 
with positive weight, edges entering S contribute with negative weight, and 
edges within S contribute positively at their tails and negatively at their heads. 
The net flow out of S is thus the sum of the net flows out of the vertices of S, 
which is zero. 

This implies that the net flow across any edge cut is zero, so it cannot 
consist of a single edge with nonzero weight. | 
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Thus we restrict our attention to graphs without cut-edges (bridgeless 
graphs). What distinguishes flows here from circulations in Section 4.3 is that 
we forbid zero as a weight. Nowhere-zero flows enable us to extend Tait’s Theo- 
rem. We begin by interpreting Eulerian graphs in the context of nowhere-zero 
flows; connectedness is no longer important. 


7.3.18. Definition. A graph is an even graph if every vertex has even degree. 


7.3.19. Proposition. A graph has a nowhere-zero 2-flow if and only if it is an 
even graph. 


Proof: Given a nowhere-zero 2-flow, we obtain a positive 2-flow. Since this as- 
signs weight 1 to every edge, the orientation must have as many edges entering 
each vertex as leaving it. Thus each vertex degree is even. 

Conversely, when each vertex degree is even, each component has an Eule- 
rian circuit. Orienting the edges to follow such a circuit and assigning weight 
1 to each edge yields a positive 2-flow. | 


Nowhere-zero 3-flows are more subtle, even for 3-regular graphs. 


7.3.20. Proposition. (Tutte [1949]) A cubic graph has a nowhere-zero 3-flow 
if and only if it is bipartite. > 


01 


Proof: Let G be a cubic X, Y-bigraph.<Every regular bipartite graph has a 1- 
factor. Orient the edges of a 1-factor from X to Y, and give them weight 2. 
Orient all other edges from Y to X, and give them weight 1. The fio in and 
out of every vertex is 2, so this is a nowhere-zero 3-flow. 

Conversely, let G be a cubic graph with a nowhere-zero 3-flow. By Propo- 
sition 7.3.15, we may assume that the flow is 1 or 2 on each edge. Since the 
net flow is 0, there must be one edge with flow 2 and two edges with flow 1 at 
each vertex. Thus the edges with flow 2 form a matching. The X be the set of 
tails and Y the set of heads of these edges. Since the net flow is 0 at each ver- 
tex, each edge with flow 2 points from X to Y, and each edge with flow 1 points 
from Y to X. Thus X, Y is a bipartition of G. | 


7.3.21. Example. Since the Petersen graph is cubic and not bipartite, it has no 
nowhere-zero 3-flow. We will see that it also has no nowhere-zero 4-flow. Below 
we show a nowhere-zero 4-fiow in the 3-regular simple graph C30 Ko. | 


To understand the duality between flows and colorings, we characterize 
the plane graphs with nowhere-zero k-flows. 
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7.3.22. Theorem. (Tutte [1954b]) A plane bridgeless graph is k-face-colorable 
if and only if it has a nowhere-zero k-flow. 


Proof: (Younger [1983], refined by Seymour) Let f be a flow on a plane graph 
G. We define a function g qn the set of faces by letting g(F) be the net flow 
accumulated by traveling from face F out to the unbounded face. Each time we 
cross an edge e we count + f(e) if e is directed toward our right, — f(e) if e is 
directed toward our left. The value assigned to the outside face is 0. 

The function g is well-defined; that is, g(F) is independent of our route 
to the outside face. We can change a route into any other by a succession of 
changes where we go the “other way” around some vertex v (shown on the left 
below). The change increases or decreases our accumulation for this portion by 
the net flow out of v, which is 0. Note that the difference between the values 
on faces with a common edge e is +f (e). , 


1_ books 


Conversely, given a function g defined on the faces, we can invert the pro- 
cess to obtain a flow (shown on the right above). As we stand on face F and 
look and face F’ across edge e, we let f(e) = g(F) — g(F’) ife is directed toward 
our right, f(e) = g(F’) — g(F) if e is directed toward our left. 

Thus flows correspond to face-colorings. The face-coloring is proper if and 
only if the flow is nowhere-zero. If the flow is a nowhere-zero k-flow, then re- 


ducing the labels in the coloring to congruence classes in {0,..., k — 1} produces 
a proper k-coloring. Conversely, a proper k-face-coloring using these colors pro- 
duces a nowhere-zero k-flow. | 


The correspondence between face-labelings and flows in Theorem 7.3.22 is 
valid when the labels come from any abelian group. Applied using the group 
of binary ordered pairs under addition ((0, 0) is the identity), the statement 
proved by this argument is precisely Tait’s Theorem itself. 

Since we can study flows on all graphs, we can consider the flow problem 
as a general dual notion to vertex coloring. “Nowhere-zero” is the analogue 
of “proper”. Since every nowhere-zero k-flow is a nowhere-zero k + 1-flow, the 
natural problem is to minimize k such that G has a nowhere-zero k-flow. This 
minimum is the flow number of G, by analogy with “chromatic number”. Since 
we say “G is k-colorable” when G has a proper k-coloring, the natural analogue 
would be to say “G is k-flowable” instead of “G has a nowhere-zero k-flow”. 
This language is not yet common, so we will use it sparingly. 

By Tait’s Theorem, Theorem 7.3.22 states that a cubic bridgeless planar 
graph is 3-edge-colorable if and only if it has a nowhere-zero 4-flow. We want 
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to extend this correspondence by dropping the condition on planarity. A simple 
observation about parity will be useful. 


7.3.23. Lemma. In a nowhere-zero k-flow, every vertex is incident to an even 
number of edges of odd weight. 


Proof: Since at each vertex the total weight on entering edges equals the total 
weight on exiting edges, the sum of the weights is even. | 


7.3.24. Theorem. Let G be a cubic graph. If G has a nowhere-zero 4-flow, then 
G is 3-edge-colorable. 


Proof: By Proposition 7.3.15, we may assume that G has a positive 4-flow 
(D, f), and thus f(e) € {1, 2, 3} for each edge e. By Lemma 7.3.23, each vertex 
is incident to exactly one edge of weight 2. Thus the edges of weight 2 form a 
1-factor in G, and deleting them leaves a union of disjoint cycles. To complete 
a 1-factorization, it suffices to show that each of these cycles has even length. 
Let C be such a cycle. The edges of weight 2 that are incident to vertices 
of C are chords or join V(C) with V(C). The chords occupy an even size subset 
of V(C). Thus it suffices to show that the number of edges between V(C) and 
V(C) is even. These edges all have weight 2. Since the net flow out of V(C) 
must be 0 and all edges between V(C) and V(C) have flow 2, the number of 
edges leaving V(C) must equal the number of edges entering it. | 


a 
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Since the Petersen graph is not 3-edge-colorable, Theorem 7.3.24 implies 
that it is not 4-flowable. Existence of nowhere-zero k-flows is preserved by 
subdivision: when an edge e of weight j in a nowhere-zero k-fiow is subdivided, 
replacing it with a path of length 2 oriented in the same direction with weight / 
on both edges yields a nowhere-zerok-flow in the new graph. Thus subdivisions 
of the Petersen graph also have no nowhere-zero 4-flows. 

The converse of Theorem 7.3.24 is true but not trivial, since it may not be 
possible to treat the color classes as edge sets of fixed weight and orient the 
graph to make this a 4-flow. In the graph C3 0 Kz of Example 7.3.21, there 
is essentially only one proper 3-edge-coloring, and when the color classes are 
labeled 1, 2,3 it is not possible to obtain a 4-flow. In the positive 4-flow in 
Example 7.3.21, the edges of weight 1 do not form a matching. 

Nevertheless, we can apply the next theorem to guarantee nowhere-zero 
4-flows in cubic graphs. The characterization is more general, since it does not 
require regularity. 


7.3.25. Theorem. A graph has a nowhere-zero 4-flow if and only if it is the 
union of two even graphs. 
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Proof: Let G:, Gz be even graphs with G = G; U G2. Let D be an orientation 
of G, restricting to D; on G;. By Proposition 7.3.19 and Proposition 7.3.15, G; 
has a nowhere-zero 2-flow (D;, f;). Extend f; to E(G) by letting f;(e) = 0 for 
e € E(G) — E(G;). Let f = f, + 2fo. This weight function is odd on E(G,) and 
is +2 on E(G) — E(G;), so itis nowhere-zero. Its magnitude is : lways at most 
3, and by Proposition 7.3.16 (D, f) is a flow; thus it is a nowhere-zero 4-flow. 
Conversely, let (D, f) be a nowhere-zero 4-flow on G. Let E, = {e € 
E(G): f(e) is odd}. By Lemma 7.3.23, E; forms an even subgraph of G. Thus 
there is a nowhere-zero 2-flow (D,, f,) on E;, where D; agrees with D. Extend 
fi to E(G) by letting fi(e) = 0 for e € E(G) — E 1; now (D, f;) is a 2-flow on G. 
Define fo on E(G) by fo = (f — fi)/2. By Proposition 7.3.16, (D, fo) is a 
flow on G. It is an integer flow, since f(e) — fi(e) is always even. By Lemma 
7.3.23, the set Eg = {e € E(G): fo(e) is odd} forms an even subgraph of G. For 
e € E(G) — Ej, we have fe) = +2 and fi(e) = 0, which yields fo(e) = +1, so 
E(G) — E, € Ez. Now G is the union of two even subgraphs. a 


7.3.26. Corollary. If G is a cubic graph, then G is 3-edge-colorable if and only 
if G has a nowhere-zero 4-flow. 


Proof: Every 3-edge-colorable cubic graph is the union of two even subgraphs: 
the edges of colors 1 and 2, and the edges of colors 1 and 3. w 


In light of Theorem 7.3.22, Corollary 7.3.26 generalizes Tait’s Theorem. 

We have seen that subdivisions of the Petersen graph are not 4-flowable. 
Among bridgeless graphs, Tutte a ais that excluding such subgraphs 
yields nowhere-zero 4-flows. 


elegra 


7.3.27. Conjecture. (Tutte’s 4-flow ee ae [1966b]) Every bridge- 
less graph containing no subdivision of the Petersen graph is 4-flowable. lm 


Since every graph containing a subdivision of the Petersen graph is nonpla- 
nar, Tutte’s 4-flow Conjecture implies the Four Color Theorem. Since nowhere- 
zero 4-flows are equivalent to 3-edge-colorings on cubic graphs, the 4-flow Con- 
jecture also implies the 3-edge-coloring Conjecture (which has been proved). 
Researchers have hoped for an elegant proof of Tutte’s 4-flow Conjecture as a 
way of obtaining a shorter proof of the Four Color Theorem. 

We close this section by describing of several other famous conjectures 
related to these. Every nowhere-zero k-flow is a nowhere-zero k + 1-flow, so con- 
ditions for nowhere-zero 3-flows or 5-flows should be more or less restrictive, 
respectively, than conditions for a nowhere-zero 4-flow. Statements of Tutte’s 
3-flow Conjecture appear in Steinberg [1976] and in Bondy—Murty [1976, Un- 
solved Problem 48]. 


7.3.28. Conjecture. (Tutte’s 3-flow Conjecture) Every 4-edge-connected graph 
has a nowhere-zero 3-flow. | 


7.3.29. Conjecture. (Tutte’s 5-flow Conjecture—Tutte [1954b]) Every bridge- 
less graph has a nowhere-zero 5-flow. a 
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Kilpatrick [1975] and Jaeger [1979] proved that every bridgeless graph is 
8-flowable. Seymour [1981] proved that these graphs are 6-flowable. We sketch 
the ideas of the 8-flow Theorem; details are requested in exercises. 

Both proofs reduce to the 3-edge-connected case, by showing that a small- 
est bridgeless graph without a nowhere-zero k-flow is simple, 2-connected, and 
3-edge-connected (Exercise 26). The main step is then to express a 3-edge- 
connected graph as a union of subgraphs with good flows. A generalization of 
Theorem 7.3.25 then applies: If G,; has a nowhere-zero k,-flow and G2 has a 
nowhere-zero k2-flow, then G; U Gg has a nowhere-zero k,k2-flow (Exercise 24). 
(The converse also holds but is not needed.) 

For the 8-flow Theorem, it then suffices to prove that a 3-edge-connected 
graph can be expressed as the union of three even subgraphs. First, adding an 
additional copy of each edge in G yields a 6-edge-connected graph G’. Then, the 
Tree—Packing Theorem of Nash-Williams (Corollary 8.2.59) yields three pair- 
wise edge-disjoint spanning trees in G’. These correspond to three spanning 
trees in G. Since we obtained them as edge-disjoint trees in G’, each edge of G 
appears in at most two of them. 

Within a spanning tree of G, one can find a parity subgraph of G, mean- 
ing a spanning subgraph H such that dy(v) = dg(v) mod 2 for all v € V(G) 
(Exercise 25). The complement within £(G) of the edge set of a parity subgraph 
is an even subgraph of G. Since our three spanning trees have no common edge, 
the complements of their parity subgraphs express G as a union of three even 
subgraphs. By Proposition 7.3.19, each has a nowhere-zero 2-flow, and hence 
G has a nowhere-zero 8-flow. : 

The approach in Seymour [1981] ig similar; the task is to express a 3-edge- 
connected graph as a union of an even graph and a 3-flowable graph. This 
uses more subtle concepts, including a notion of “modular” flows originally in- 
troduced by Tutte [1949]. Seymour’s proof was refined by Younger [1983] and 
Jaeger [1988]. We refer the reader to Zhang [1997] for an exposition. 

Celmins [1984] proved that if the 5-flow Conjecture is false, then the small- 
est counterexample is a snark having girth at least 7 and no nontrivial edge 
cut with four edges. 


im. 


We describe one additional conjecture and its relation to earlier topics. In 
a 2-edge-connected plane graph, all facial boundaries are cycles. Each edge 
lies in the boundary oi two faces, so the facial cycles together cover every edge 
exactly twice. It is reasonable to ask whether such a covering can be obtained 
also for graphs that are not planar. 


7.3.30. Definition. A cover of a graph G is a list of subgraphs whose union 
is G. A double cover is a cover with each edge appearing in exactly two 
subgraphs in the list. A cycle double cover (CDC) is a double cover 
consisting of cycles. 


7.3.31. Example. Together with the outer 5-cycle, the 5 rotations of the 5-cycle 
illustrated below form a CDC of the Petersen graph. The Petersen graph also 
has CDCs using cycles of other lengths (Exercise 36). a 
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Since cut-edges appear in no cycles, only bridgeless graphs have CDCs. 


7.3.32. Conjecture. (Cycle Double Cover Conjecture—Szekeres [1973], Sey- 
mour [1979b]) Every bridgeless graph has a cycle double cover. | 


One might think that the CDC Conjecture follows immediately using em- 
beddings on surfaces with handles, but such embeddings may have facial bound- 
aries that traverse the same edge twice. The Strong Embedding Conjecture 
asserts that every 2-connected graph has an embedding (on some surface) in 
which the boundary of each face is a single cycle. Applying this to each block of 
a 2-edge-connected graph would yield the CDC Conjecture. 

In discussing the CDC, we must alert the reader to an unfortunate conflict 
in terminology. Throughout this book, we use the definition of cycle that is 
common in discussing connectivity, girth, circumference, planarity, etc. In this 
language, a circuit is an equivalence class of closed trails (ignoring the starting 
vertex), and an even graph is a graph whose vertex degrees are all even. A 
circuit traverses a connected even graph. 

The literature on cycle covers generally reverses this terminology, using 
“circuit” to mean what we call a cycle and “cycle” to mean what we call an even 
graph. Since the term “even graph” strongly evokes its definition, we hope that 
our usage will be clear. 

The alternative usage arises from other contexts. In a matroid (Section 
8.2), the circuits are the minimal dependent sets, and in the cycle matroid of a 
graph these are the edge sets of the cycles. The cycle space of a graph is a vector 
space (using scalars {0, 1}) where the coordinates are indexed by the edges and 
the vectors correspond to the even subgraphs. 

The original CDC Conjecture states that every bridgeless graph has a dou- 
ble cover by even subgraphs. That phrasing is equivalent to ours, since every 
even graph is an edge-disjoint union of cycles. 

Thus we might seek a double cover by using a small number of even sub- 
graphs. The cycles in a cycle double cover are even subgraphs; when cycles are 
pairwise edge-disjoint, they can be combined to form a single even subgraph. 
This leads to the connection between integer flows and cycle double covers. 


7.3.33. Proposition. A graph has a nowhere-zero 4-flow if and only if it has a 
cycle double cover forming three even subgraphs. 
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Proof: Theorem 7.3.25 states that a graph has a nowhere-zero 4-flow if and 
only if it is the union of two even subgraphs £, Eg. Let E3 = E,;aE>. At each 
vertex v the degree in £3 is the sum of the degrees in FE; and E2 minus twice 
the number of common incident edges; hence it is even. Hence E3 is an even 
subgraph, and it contains precisely the edges that appear in just one of {F1, E4}. 
Cycle décompositions of E,, Ez, E3; thus combine to yield a CDC. 

Conversely, if a CDC forms three even subgraphs, then omitting one of 
them leaves the graph expressed as the union of two even subgraphs, and hence 
a nowhere-zero 4-flow exists. a 


Let P denote the family of graphs that do not contain a subdivision of the 
Petersen graph. By Proposition 7.3.33, Tutte’s 4-flow Conjecture implies that 
every graph in P has a CDC. Alspach—-Goddyn-—Zhang [1994] proved a deep re- 
sult that yields cycle double covers for graphs in P. (They proved that a stronger 
covering property holds for G if and only if G € P.) In light of Proposition 7.3.33, 
this is a partial result toward Tutte’s 4-flow Conjecture. 

The CDC Conjecture is also related to snarks. Goddyn [1985] proved that 
if the CDC Conjecture is false, then the smallest counterexample is a snark 
with girth at least 8. 


EXERCISES 
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7.3.1. (—) Prove that every Hamiltonian 3-regular graph has a Tait coloring. 


7.3.2. (—) Exhibit 3-regular simple graphs with the following properties. 
a) Planar but not 3-edge-colorable. 
b) 2-connected but not 3-edge-colorable. 
c) Planar with connectivity 2, but not Hamiltonian. 


7.3.3. Prove that every maximal plane graph other than K, is 3-face-colorable. 


7.3.4. Without using the Four Color Theorem, prove that every Hamiltonian plane 
graph is 4-face-colorable (nothing is assumed about the vertex degrees). 


7.3.5. Prove that a 2-edge-connected plane graph is 2-face-colorable if and only if it is 
Eulerian. 


7.3.6. Use Tait’s Theorem (Theorem 7.3.2) to prove that x'(G) = 3 for the graph G below. 
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7.3.7. (!) Let G be a plane triangulation. 
a) Prove that the dual G* has a 2-factor. 
b) Use part (a) to prove that the vertices of G can be 2-colored so that every face 
has vertices of both colors. (Hint: Use the idea in the proof of Theorem 7.3.2.) (Burstein 
[1974], Penaud [1975]) 


7.3.8. (+) It has been conjectured that every planar triangulation has edge-chromatic 
number A(G), and this has been proved when A(G) is high enough. Show that x’(G) = 
A(G) for the graph of the icosahedron, illustrated below. 


7.3.9. Prove that a proper 4-coloring of the icosahedron uses each color exactly 5 times. 


7.3.10. Whitney [1931] proved that every 4-connected planar triangulation is Hamilto- 
nian. Use this to reduce the Four Color Problem to the problem of proving that every 
Hamiltonian planar graph is 4-colorable. 


mat 


7.3.11. Find a 5-connected planar graph. Does there exist a 6-connected planar graph? 


7.3.12. Let G be a planar graph with at least three faces. Prove that G has a vertex par- 
tition into two sets whose induced subgraphs are trees if and only if G* is Hamiltonian. 


7.3.13. (!) For each of the planar graphs below, present a Hamiltonian cycle or use 
planarity (Grinberg’s condition) to prove that it is non-Hamiltonian. 


7.3.14. Let G be the graph below. Prove that G has no Hamiltonian cycle. Explain why 
Grinberg’s Theorem cannot be used directly to prove this. 
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7.3.15. (!) Prove Grinberg’s Theorem using Euler’s Formula. 


7.3.16. (!) Use Grinberg’s condition to prove that the Grinberg graph (below) is not 
Hamiltonian. 


7.3.17. (!) The smallest known 3-regular 3-connected planar graph that is not Hamil- 
tonian has 38 vertices and appears below. Prove that this graph is not Hamiltonian. 
(Lederberg [1966], Bosdk [1966], Barnette): 


7.3.18. Let G be the grid graph P,,O P,. Let Q be a Hamiltonian path from the upper 
left corner vertex to the lower right corner vertex, such as that shown in bold below. 
Note that Q partitions the grid into regions, of which some open to the left or downward 
and others open to the right or upward. Prove that the total area of the up-right regions 
(B) equals the total area of the down-left regions (A). (Fisher—Collins—Krompart [1994]) 


7.3.19. (!) The generalized Petersen graph P(n,k) is the graph with vertices 
{uy,...,Un} and {v,,..., v,} and edges {u;uj41}, {u;v;}, and {v;v;.,}, where addition is 
modulo n. The Petersen graph itself is P(5, 2). 

a) Prove that the subgraph of P(n, 2) induced by k consecutive pairs {u;, v;} has a 
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spanning cycle if k = 1 mod 3 and k > 4. 
b) Use part (a) to prove that x'(P(n, 2)) = 3 ifn > 6. 
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7.3.20. (—) Let G be a 3-regular graph. Prove that if G is the union of three cycles, then 
G is 3-edge-colorable. 


7.3.21. (+) “Flower snarks”. Let G, and H, be as constructed in (Example 7.3.13). 
a) Prove that G, is 3-edge-colorable. 
b) Prove that H, is not 3-edge-colorable when k is odd. (Isaacs [1975]) 


7.3.22. Prove that every edge cut of K, OC, that does not isolate a vertex has at least 
2k edges. 


7.3.23. (*) Prove that applying the dot product operation (Definition 7.3.12) to two 
snarks yields a third snark. (Isaacs [1975]) 


7.3.24. (x!) Let G; and Gz be graphs. Prove that if G,; has a nowhere-zero k,-flow and 
G2 has a nowhere-zero k2-flow, then G, U Gz has a nowhere-zero k,k2-flow. 


7.3.25. (!) A parity subgraph of G is a subgraph H such that dy (v) = dg(v) mod 2 for 
all v € V(G). Prove that every spanning tree of a connected graph G contains a partity 
subgraph of G. (Itai-Rodeh [1978]) 
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7.3.26. (*) For k >. 3, prove that a smallest-nontrivial 2-edge-connected graph G having 
no nowhere-zero k-flow must be simple, 2-connected, and 3-edge-connected. (Hint: First 
exclude loops and vertices of degree 2 and: reduce to consideration of blocks. Then ex- 
clude multiple edges and finally edge cuts of size 2. In each case, compare G to a graph 
obtained from it by deleting or contracting edges.) 


7.3.27. (*) Prove that every Hamiltonian graph has a nowhere-zero 4-flow. 


7.3.28. («) Prove that every bridgeless graph with a Hamiltonian path has a nowhere- 
zero 5-flow. (Jaeger [1978]) 


7.3.29. (*) Embed Kg on the torus, and let G be the dual graph. Find a nowhere-zero 
5-flow'on G. 


7.3.30. («) Prove that a graph G is the union of r even subgraphs if and only if G has a 
nowhere-zero 2’-flow. (Matthews [1978]) 


7.3.31. (x) Let G be a graph having a cycle double cover forming 2’ even subgraphs. 
Prove that G has a nowhere-zero 2’-flow. (Jaeger [1988]) 


7.3.32. («!) A modular 3-orientation of a graph G is an orientation D such that 
di(v) = dp5(v) mod 3 for all v € V(G). Prove that a bridgeless graph has a nowhere-zero 
3-flow if and only if it has a modular 3-orientation. (Steinberg—Younger [1989]) 


7.3.33. («) Characterization of nowhere-zero k-flows. Let G be a bridgeless graph, let 
D be an orientation of G, and let a and b be positive integers. Prove that the following 
statements are co (Hoffman [1958]) 

a) 4 | [S, 3] < 


b= =1s s]| —a 2 for every nonempty proper vertex subset Ss. 
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b) G has an integer flow using weights in the interval [a, 5]. 
c) G has a real-valued flow using weights in the interval [a, 5]. 


7.3.34. (*) Find cycle double covers for the graphs C,, V Ki, Cm V 2Ki, and Cy V Ko. 


7.3.35. (*) Find the cycle double covers with fewest cycles for every 3-regular simple 
graph with 6 vertices. 


7.3.36. (*—) Let G be the Petersen graph. Find a cycle double cover of G whose elements 
are not all 5-cycles. Find a double cover of G consisting of 1-factors. (Hint: Consider the 
drawing of G having a 9-cycle on the “outside”. Comment: Fulkerson [1971] conjectured 
that every bridgeless cubic graph has a double cover consisting of 6 perfect matchings.) 


7.3.37. (*) Prove that any two 6-cycles in the Petersen graph must have at least two 
common edges. Conclude that the Petersen graph has no CDC consisting of five 6-cycles. 
Use this and Exercise 7.3.20 to conclude that the Petersen graph has no CDC consisting 
of even cycles. (C.Q. Zhang) 


7.3.38. (x!) A cycle double cover is orientable if its cycles can be oriented as directed 
cycles so that for each edge, the two cycles containing it traverse it in opposite directions. 
A digraph is even if d~(v) = d*(v) for each vertex v. 

a) Suppose that G has a nonnegative k-flow (D, f). Prove that f can be expressed 
as pees , Ji, where each (D, f;) is a nonnegative 2-flow on G. (Hint: Use induction on k.) 
(Little-Tutte—-Younger [1988] 

b) Prove that a graph G has a positive k- flow (D, f) if-and only if D is the union of 
k — 1 even digraphs such that each edge e in D appears in exactly f(e) of them. (Little— 
Tutte—Younger [1988]) 

c) Prove that a graph G hasa ee 3-flow if and only if it has an orientable 
cycle double cover forming three even subgraphs. (Tutte [1949]) 


7.3.39. («) Let G be a graph having a CDC formed from four even subgraphs. Prove that 
G also has a CDC formed from three even subgraphs. (Hint: Use symmetric differences.) 


7.3.40. (*) In the Petersen graph, prove that the solution to the Chinese Postman Prob- 
lem has total length 20, but the minimum total length of cycles covering the Petersen 
graph is 21. 


7.3.41. (*) Let M be a perfect matching in the Petersen graph. Prove that there is no 
list of cycles in the Petersen graph that together cover every edge of M exactly twice and 
all other edges exactly once. (Itai—-Rodeh [1978], Seymour [1979b]) 


7.3.42. (*) Let G be a graph in which a shortest covering walk (that is, an optimal 
solution to the Chinese Postman Problem) decomposes into cycles. Prove that G has a 
cycle cover of total length at most e(G) +n(G) — 1. Determine the minimum length of a 
cycle cover of K3, in terms of the number of edges and vertices. 
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Additional Topics 


In this chapter we explore more advanced or specialized material. Each 
section gives a glimpse of a topic that deserves its own chapter (or book). Sev- 
eral sections treat more difficult material near the end. 


8.1. Perfect Graphs 


We have discussed the lower bound x(G) > w(G) for chromatic number; 
the vertices of a clique need different colors. In Section 5.3, we discussed graphs 
whose induced subgraphs all achieve equality in this bound. 


8.1.1. Definition. A graph G is perfect if x(H) = w(H) for every induced 
subgraph H of G. 


When discussing perfect graphs, it is common ta use stable set to mean 
an independent set of vertices. As before, a clique is a set of pairwise adjacent 
vertices. As usual, maximum means maximum- -sized. 

Since we focus on vertex coloring, % again in this section we restrict our at- 
tention to simple graphs. Complementation converts cliques to stable sets and 
vice versa, so w(H) = a(H). Properly coloring H means expressing V(H) as 
a union of cliques in H; such a set of cliques in H is a clique covering of H. 
Thus for every graph G we have four optimization parameters of interest. 


independence number’ a(G) max size ofa stable set 
clique number w(G) max size of a clique 
chromatic number x(G) min size of a coloring 
clique covering number 6(G) min size of a clique covering 


Berge actually defined two types of perfection: 


G is y-perfect if x(G[A]) = w(G[A]) for all A C V(G). 
G is a-perfect if 6(G[A]) = a(G[A]) for all A € V(G). 
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Our definition of perfect is the same as this definition of y-perfect (Berge 
used y(G) for chromatic number). Since G[A] is the complement of G[A], the 
definition of a-perfect can be stated in terms of G as “x(G [A]) = w(G[A]) for all 
AC V(G)”. Thus “G is a-perfect” has the same meaning as “G is y-perfect”. 

We now use only one definition of perfection, because Lovasz [1972a] proved 
“G is y-perfect if and only if G is a-perfect”. In terms of our original defini- 
tion of perfection, this becomes “G is perfect if and only if G is perfect”. This 
statement is the Perfect Graph Theorem (PGT). 

Always x(G) > w(G) and 6(G) > a(G), since a clique and a stable set share 
at most one vertex. A statement of perfection for a class of graphs is thus an 
integral min-max relation. We observed in Example 5.3.21 that several familiar 
min-max relations are statements that bipartite graphs, their line graphs, and 
the complements of such graphs are perfect. 

If k > 2, then x(Cox41) > @(Cox41) and x(Cax41) > @(Cox41) (Exercise 1). 
Thus odd cycles and their complements (except C3 and C3) are imperfect. 


8.1.2. Conjecture. (Strong Perfect Graph Conjecture (SPGC)—Berge 
[1960]) A graph G is perfect if and only if both G and G have no induced 
subgraph that is an odd cycle of length at least 5. | 
The SPGC remains open. Since ‘the condition in the conjecture is self- 

complementary, the SPGC implies the-PGT. 

Having presented several classical families of perfect graphs in Section 
5.3, our goal now is to prove the Perfect Graph Theorem. Later we also study 
properties of minimal imperfect graphs and classes of perfect graphs. For fur- 
ther reading, Golumbic [1980] provides a thorough introduction to the subject. 
Berge—ChvAtal [1984] collects and updates many of the classical papers. 


THE PERFECT GRAPH THEOREM 


In 1960, Berge conjectured that )-perfection and 'a-perfection are equiva- 
lent (see Berge [1961]). Lovdsz [1972a] stunned the world of combinatorics by 
proving this important and well-known conjecture at the age of 22. Fulkerson 
also studied it, reducing it to a statement he thought was too strong to be true. 
When Berge told him that Lovasz had proved it, within hours he proved the 
missing lemma (Lemma 8.1.4), thus illustrating that a theorem becomes easier 
to prove when known to be true (Fulkerson [1971]). 

We will prove the Perfect Graph Theorem using an operation that enlarges 
a graph without affecting the property of perfection. 


8.1.3. Definition. Duplicating a vertex x of G produces a new graph Gox 
by adding a vertex x’ with N(x’) = N(x). The vertex multiplication of G 
by the nonnegative integer vector h = (hi,...,h,) is the graph H = Goh 
whose vertex set consists of h; copies of each x; € V(G), with copies of x; 
and x; adjacent in H if and only if x; + x; in G. 
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X4 X3 


X5 X1 


G Goxy G o (2, 1, 0, 3, 1) 


8.1.4. Lemma. Vertex multiplication preserves y-perfection and a-perfection. 


Proof: We first observe that G oh can be obtained from an induced subgraph 
of G by successive vertex duplications. If every h; is 0 or 1, then Goh = G[A], 
where A = {i: h; > 0}. Otherwise, start with G[A] and perform duplications 
until there are h; copies of x; (for each i). Each vertex duplication preserves the 
property that copies of x; and x; are adjacent if and only if x;x; € E(G), so the 
resulting graph is G oh. 

If G is a-perfect but Goh is not, then some operation in the creation of Goh 
from G[A] produces a graph that is not a-perfect from an a-perfect graph. It 
thus suffices to prove that vertex duplication preserves a-perfection. The same 
reduction holds for y-perfection. Since every proper induced subgraph of G o x 
is an induced subgraph of G or a vertex duplication of an induced subgraph of 
G, we further reduce our claim to showing that y(G o x) = w(G ox) when G is 
y-perfect and that a(G ox) = 6(Gox) when G is a-perfect. 

When G is y-perfect, we extend a proper coloring of G to a proper coloring 
of G o x by giving x’ the same color as-x. No clique contains both x and x’, so 
w(G ox) = w(G). Hence x(G ox) = x(G@) = w(G) = wo(G ox). 

When G is a-perfect, we consider:two cases. If x belongs to a maximum 
stable set in G, then adding x’ to it yields a(G ox) = a(G) +1. Since 6(G) = 
a(G), we can obtain a clique covering of this size by adding x’ as a 1-vertex 
clique to some set of 6(G) cliques covering G. 

If x belongs to no maximum stable set in G, then a(G o x) = a(G). Let Q 
be the clique containing x in a minimum clique cover of G. Since 6(G) = a(G), 
Q intersects every maximum stable set in G. Since x belongs to no maximum 
stable set, Q’ = Q — x also intersects every maximum stable set. This yields 
a(G — Q’) = a(G) — 1. Applying the a-perfection of G to the induced subgraph 
G— Q’ (which contains x) yields 6(G — Q’) = a(G — Q’). Adding Q’U {x’} to a set 
of a(G) — 1 cliques covering G — Q’ yields a set of a(G) cliques covering G ox. lJ 


< maximum 


< stable sets 


CY> 


Te ee 
clique covering of 
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8.1.5. Lemma. In a minimal imperfect graph, no stable set intersects every 
maximum clique. 


Proof: If a stable set S in G intersects every w(G)-clique, then perfection of 
G — S yields x(G — S) = w(G — S) = w(G) — 1, and S completes a proper oy 
coloring of G. This makes G perfect. 


8.1.6. Theorem. (The Perfect Graph Theorem (PGT) - Lovasz [1972a, 1972b]) 
A graph is perfect if and only if its complement is perfect. 


Proof: It suffices to show that a-perfection of G implies y-perfection of G; apply- 
ing this to G yields the converse. If the claim fails, then we consider a minimal 
graph G that is a-perfect but nut y-perfect. By Lemma 8.1.5, we may assume 
that every maximal stable set S in G misses some maximum clique Q(S). 

We design a special vertex multiplication of G. Let S = {5;} be the list of 
maximal stable sets of G. We weight each vertex by its frequency in {Q(S;)}, 
letting h; be the number of stable sets S; ¢ S such that x; € Q(S;). By Lemma 
8.1.4, H = Goh isa-perfect, yielding a(H) = 9(H). We use counting arguments 
for a(H) and 6(#) to obtain a contradiction. 

Let A be the 0,1-matrix of the ixcidence relation between {Q(S;)} and V(G); 
we have a;,; = 1 if and only if x; € Q(S;). By construction, h; . is the number of 
1s in column j of A, and n(H) is the total number of 1s in A. Since each row has 

‘w(G) 1s, alson(H) = w(G) |S|. Since vertex duplication cannot enlarge cliques, 
we have w(H) = w(G). Therefore, 0(H) > > n(H)/w(A) = |S. 

We obtain a contradiction by proving that a(H) < |S|. Every stable set in 
H consists of copies of elements in some stable set of G, so a maximum stable 
set in H consists of all copies of all vertices in some maximal stable set of G. 
Hence a(H) = maxres ) ,,. ger h;. The sum counts the 1s in A that appear in 
the columns indexed by 7. If we count these 1s instead by rows, we obtain 
a(H) = maxres )\5-5|T 9 Q(S)|. Since T is a stable set, it has at most one 
vertex in each chosen clique Q(S). Furthermore, T is disjoint from Q(T). With 
[TM Q(S)| < 1 for all S € S, and |7 N Q(T)| = 0, we have a(#) < |S] — 1. | 


V(G) 


8.1.7.* Remark. Linear optimization and duality. Clique-vertex incidence 
matrices also arise in expressing a and @ as integer optimization problems. A 
linear (maximization) program can be written as “maximize c - x over nonnega- 
tive vectors x such that Ax < b”, where A is a matrix, b, c are vectors, and each 
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row of Ax < bis a linear constraint a; - x < b; on the vector x of variables. A 
vector x satisfying all the constraints is a feasible solution. 

An integer linear program requires that each x; also be an integer. Let 
A be the incidence matrix between maximal cliques and vertices in a graph G; 
we have a;,; = 1 when v; € Q;. By definition, a(G) is the solutio1 to “max 1, - x 
such that Ax < 1,,” when the variables are required to be nonnegative integers. 
In the solution, x; is 1 or 0 depending on whether v, is in the maximum stable 
set; the constraints prevent choosing adjacent vertices. Similarly, when B is 
the incidence matrix between maximal stable sets and vertices, w(G) is the 
solution to “max 1, -x such that Bx < 1,” with integer variables. 

Every maximization program has a dual minimization program. When 
the max program is “maxc- x such that Ax < b”, the dual is “min y - b such 
that y’A > c”. This program has a variable y; for each original constraint and 
a constraint for each original variable x;, and it interchanges c, max, < with 
b,min, >. When stated in this form, the variables in both programs must be 
nonnegative. The integer programs dual to w and a seek the minimum number 
of stable sets that cover the vertices and the minimum number of cliques that 
cover the vertices, respectively; this describes x(G) and 6(G). 

Using the nonnegativity of the variables, the constraints yield 


cx <y™Ax <y-b. 


The statement “c- x < y-b” for feasible solutions x, y is weak duality. The 
(strong) Duality Theorem of Linear Programming states that dual pro- 
grams having feasible solutions have optimal solutions with the same value 
when integer solutions are not required. 

The statements xy > w and 6 > a@ are statements of weak duality for dual 
pairs of linear programs. A guarantee of strong duality using solutions that 
have only integer values is a combinatorial min-max relation. We have pre- 
sented many such relations and observed that they guarantee quick proofs of 
optimality. They also often lead to fast algorithms for finding optimal solutions, 
which is one motivation for studying families of perfect graphs. a 


8.1.8.* Example. Fractional solutions for an imperfect graph. For the 5-cycle, 
the linear programs for w, x,a,9 all have optimal value 5/2. There are five 
maximal cliques and five maximal stable sets, each of size 2. Setting each 
x; = 1/2 gives weight 1 to each clique and stable set, thereby satisfying the 
constraints for either maximization problem. Setting each y; = 1/2 in the dual 
programs covers each vertex with a total weight of 1, so again the constraints 
are satisfied. These programs have no optimal solution in integers, and the 
integer programs have a “duality gap”: x =3>2=mwand@d@=3>2=o. U5 


CHORDAL GRAPHS REVISITED 


Like trees, the more general class of chordal graphs has many character- 
izations. The definition by forbidding chordless cycles is a forbidden sub- 
structure characterization. A finite list of forbidden substructures such as 
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induced subgraphs yields a fast algorithm for testing membership in the class, 
but for chordal graphs the list is infinite and other methods are needed. 

A chordal graph can be built from a single vertex by iteratively adding a 
vertex joined to a clique; this is the reverse of a simplicial elimination order- 
ing, and we have seen that greedy coloring with respect to such a construction 
ordering yields an optimal coloring. Many classes of perfect graphs have such 
a construction procedure that produces the graphs in the class and no oth- 
ers. A construction procedure or the reverse decomposition procedure may 
lead to fast algorithms for computations on graphs in the class. 

Next we consider another type of characterization. 


8.1.9. Definition. An intersection representation of a graph G is a family 
of sets {S,: v € V(G)} such that u < v if and only if $, 0S, 4 @. If {S,} is an 
intersection representation of G, then G is the intersection graph of {S,}. 


Interval graphs are the graphs having intersection representations where 
each set in the family is an interval on the real line. Line graphs also form an 
intersection class; the allowed sets are pairs of natural numbers, corresponding 
to edges of the graph H such that G = L(H). An intersection characterization 
for chordal graphs was found Eee gwenny by Walter [1972, 1978], Gavril 
[1974], and Buneman [1974]. 


h_book 


8.1.10. Lemma. Ii 7;,..., 7; are pairwise intersecting subtrees of a tree T, 
then there is a vertex belonging to-all of T),..., Tx. 


Proof: (Lehel) We prove the contrapositive. If each vertex v misses some T (v) 
among 7),..., 7;, we mark the edge that leaves v on the unique path to T(v). 
If T has n vertices, then we make n marks, so some edge uw has been marked 
twice. Now T(u) and T(w) have no common vertex. a 


8.1.11. Theorem. A graph is chordal if and only if it has an intersection rep- 
resentation using subtrees of a tree (a subtree representation). 


Proof: We prove that the condition is equivalent to the existence of a simplicial 
elimination ordering. We use induction, with trivial basis K,. 

Let v;,..., v, be a simplicial elimination ordering for G. Since ve, ..., v, is 
a simplicial elimination ordering for G — v;, the induction hypothesis yields a 
subtree representation of G — v; in a host tree T. Since v, is simplicial in G, the 
set S = Ng(vj) induces a clique in G — v;. Therefore, the subtrees of T assigned 
to vertices of S are pairwise intersecting. 

By Lemma 8.1.10, these subtrees have a common vertex x. We enlarge T 
to a tree T’ by adding a leaf y adjacent to x, and we add the edge xy to the 
subtrees representing vertices of S. We represent v; by the subtree consisting 
only of y. This completes a subtree representation of G in T’. 

Conversely, let T be a smallest host tree for a subtree representation of G, 
with each v € V(G) represented by T(v) C T. If xy € E(T), then G must have 
a vertex u such that T(u) contains x but not y; otherwise, contracting xy into y 
would yield a representation in a smaller tree. 
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Let x be a leaf of T, and let u be a vertex of G such that T(u) contains x 
but not its neighbor. The subtrees for neighbors of u in G must contain x and 
hence are pairwise intersecting. Thus u is simplicial in G. Deleting T(u) yields 
a subtree representation of G — u. We complete a simplicial elimination order 
of G using such an ordering of G — u given by the induction hypothesis. a 


b \/\ bcd deh 


Because the class of chordal graphs is hereditary, a simplicial elimination 
ordering can start with any simplicial vertex. Thus a brute-force approach to 
finding such an ordering would be to examine neighborhoods until we find a 
simplicial vertex, delete it, and iterate. 

Rose—Tarjan—Lueker [1976] found a faster way, which was simplified fur- 
ther by Tarjan [1976]. The idea here, because there is always a simplicial vertex 
among the vertices farthest from a given vertex (proof of Theorem 5.3.17), is 
that a simplicial elimination ordering-can end at any vertex. Thus we start 
with an arbitrary vertex and list the vertices clumped around it. The result 
is a simplicial construction ordering (the reverse of a simplicial elimination or- 
dering) if and only if the graph is chordal. The algorithm was published with 
several applications in Tarjan—Yannakakis [1984]; we follow Golumbic [1984]. 


2.1.12. Algorithm. Maximum Cardinality Search (MCS) 

Input: A graph G. 

Output: A vertex numbering - a bijection f : V(G) > {1,...,n(G)}. 

Idea: For each unnumbered vertex v, maintain a label /(v) that is its degree 
among the vertices already numbered. The vertices at the end of a simplicial 
elimination ordering are those clumped around the last vertex, so in a simplicial 
construction orderiug the vertices with high labels should be added first. 
Initialization: Assign label 0 to every vertex. Set i = 1. 

Iteration: Select any unnumbered vertex with maximum label. Number it i 
and add 1 to the label of its neighbors. Augment i and iterate. | 


8.1.13. Example. The first vertex chosen in the MCS order is arbitrary. An 
application of MCS to the graph G above could start by setting f(c) = 1 and 
hence /(b) = I(d) = 1(e) = 1. Next we could select f(e) = 2 and update /(d) = 2, 
l(h) = l(u) = 1. Now d is the only vertex with label as large as 2, and hence 
f(d) = 3. We update /(b) = I(h) = I(u) = 2, l(g) = 1, l(a) = 0. Continuing 
the procedure can produce the order c, e, d, b, h, g, a, u in increasing order of f. 
This is a simplicial construction ordering, and u, a, g,h, b, d, e,c is a simplicial 
elimination ordering. | 
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8.1.14. Theorem. (Tarjan [1976]). A simple graph G is chordal if and only 
if the numbering v,..., v, produced by the Maximum Cardinality Search 
algorithm is a simplicial construction ordering of G. 


Proof: If MCS produces a simplicial construction ordering, then G is chordal. 
Conversely, suppose that G is chordal, and let f: V(G) — [n] be the numbering 
produced by MCS. A bridge of f is a chordless path of length at least 2 whose 
lowest numbers occur at the endpoints. We prove first that f has no bridge. 
Otherwise, let P = u, v1,..., 4%, w be a bridge that minimizes max{f(u), f(w)}. 
By symmetry, we may assume that f(u) > f(w) (f is used as the vertical 
coordinate to position vertices in the illustration). 

Since u is numbered in preference to uy, at time f(u), and w is already num- 
bered at that time, there exists a vertex x € N(u) — N(v;,) with f(x) < f(u). 
Letting vo = u, set r = max{j: x = v,}. The path P’ = x, v,,..., u,, w is chord- 
less, since x < w would complete a chordless cycle. Since both of f(x), f(w) 
are less than f(u), P’ is a bridge that contradicts the choice of P. Hence f has 
no bridge. 

With this claim, the proof follows by induction on n(G). It suffices to show 
that v, is simplicial, since the application of MCS to G — v, produces the same 
numbering v;,..., v,-1 that leaves v, at the end. If v, is not simplicial, then v, 
has nonadjacent neighbors u, w, in which case u, v,, w is a bridge of f. a 


Un 
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The MCS algorithm runs in time O(n(G) + e(G)) with careful implementa- 
tion. For each j, we maintain a doubly linked list of the vertices with label j. 
For each vertex we store its label and pointers to its neighbors and to its posi- 
tion in the lists. When v is numbered, in time O(1-+d(v)) we remove v from its 
list, augment its neighbors labels, and move its neighbors into the next higher 
lists. To complete the chordality test, we must also check whether the MCS or- 
der is a simplicial construction ordering (Exercise 10). Simplicial elimination 
or construction orderings quickly yield optimal colorings, cliques, stable sets, 
and clique coverings (Exercise 9). 

The alternative algorithm found by Rose, Tarjan, and Leuker is known 
as Lexicographic Breadth First Search (LBFS). Closely related to the proof of 
Theorem 5.3.17, LBFS has been used for many applications in testing graph 
properties and computing graph parameters. Corneil—Olariu—Stewart [2000] 
provides a good introduction to this topic. 
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Given a simplicial elimination ordering, Theorem 8.1.14 computes a sub- 
tree representation. When the list of maximal cliques is known, Kruskal’s 
algorithm (Theorem 2.3.3) can be used to compute a subtree representation 
without knowing a simplicial elimination ordering. 


8.1.15. Definition. A tree T is a clique tree of G if there is a bijection between 
V.(T) and the maximal cliques of G such that for each v € V(G) the cliques 
containing v induce a subtree of T. 


8.1.16. Lemma. Every tree of minimum order in which G has a subtree repre- 
sentation is a clique tree of G. 


Proof: Let T be a host tree of minimum order for a subtree representation of G. 
By Lemma 8.1.10, the vertices of a maximal clique Q in G occur at a common 
vertex q of.7. If the vertices of G assigned to some gq’ € V(T) form a proper 
subclique Q’ of Q, then the subtrees for these vertices contain the entire gq’, g- 
path in 7. The first edge of T on that path can be contracted without changing 
the intersection graph, which yields a smaller host tree. a 


The weighted intersection graph of a collection A of finite sets is a 
weighted clique in which the elements-of A are the vertices and the weight of 
each edge AA’ is |AM A’. 


1e/math 


8.1.17. Theorem. (Acharya—Las Vergnas [1982]) Let M(G) be the weighted 
intersection graph of the set of maximal cliques {Q;} of a simple graph G. 
If T is a spanning tree of M(G), thén w(T) < >~n(Q;) —n(G), with equality 
if and only T is a clique tree. 


Proof: (McKee [1993]) Let T be a spanning tree of M(G). Let 7, be the sub- 
graph of T induced by {Q;: v € Q;}. Each vertex v € V(G) contributes once to 
the weight of f for each edge of T,; hence w(T) = veviG) e(T,). Each T, is a for- 
est, so e(T,) < n(T,) — 1, with equality if and only if T, is a tree. The term n(T7,) 
contributes 1 to the size of each clique containing v. Summing the inequality 
for each vertex yields w(T) < >> n(Q;) —n(G). Equality holds if and only if each 
T, is a tree, which is true if and only if T is a clique tree. | 


M(G)_ (T solid) 
w(T) = 7 = 15-8 = )on(Q;) —n(G) 
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As a consequence of Theorem 8.1.17, we can test whether G is a chordal 
graph by finding the maximum weight of a spanning tree in M(G). Further- 
more, when G is chordal the clique trees are precisely the maximum-weight 
spanning trees of M(G) (Bernstein—Goodman [1981], Shibata [1988]; see Mc- 
Kee [1993] for related material). 


OTHER CLASSES OF PERFECT GRAPHS 


interval graphs are the intersection graphs of collections of intervals ona 
line. We proved directly in Proposition 5.1.16 that they are perfect; this also 
follows from being a subclass of the chordal graphs (Exercise 26). Interval 
graphs arise in linear scheduling problems having constraints on concurrent 
events (recall Example 5.1.15). 


8.1.18. Example. Classical applications of interval graphs. 

Analysis of DNA chains. Interval graphs were invented for the study of 
DNA. Benzer [1959] studied the linearity of the chain for higher organisms. 
Each gene is encoded as an interval, except that the relevant interval may 
contain a dozen or more irrelevant junk pieces called “introns” among the rel- 
evant pieces called “exons”. Under the hypothesis that mutations arise from 
alterations of connected segments, changes in traits of microorganisms can be 
studied to determine whether their determining amino-acid sets could inter- 
sect. This establishes a graph with traits as vertices and “common alterations” 
as edges. Under the hypotheses of linearity and contiguity, the graph is an 
interval graph, and this aids in locating genes along the DNA sequence. 


Timing of traffic lights. Given traffic streams at an intersection, a traf- 
fic engineer (or a person with common sense) can determine which pairs of 
streams may flow simultaneously. Given an “all-stop” moment in the cycle, the 
intersection graph of the green-light intervals must be an interval graph whose 
edges are a subset of the allowable pairs. These can be studied to optimize 
some criterion such as average waiting time (see Roberts [1978]). 


Archeological seriation. Given pottery samples at an archeological dig, we 
seek a time-line of what styles were used when. Assume that each style was 
used during one time interval and that two styles appearing in the same grave 
were used concurrently. Let two styles be an edge if they appear together in 
a grave. If this graph is an interval graph, then its interval representations 
are the possible time-lines. Otherwise, the information is incomplete, and the 
desired interval graph requires additional edges. u 


We present two characterizations of interval graphs. Property B in The- 
orem 8.1.20 is due to Gilmore and Hoffman [1964], and property C is due to 
Fulkerson and Gross [1965]. 


8.1.19. Definition. A 0,1-matrix has the consecutive ls property (for 
columns) if its rows can be permuted so that the 1s in each column ap- 
pear consecutively. The clique-vertex incidence matrix of G is the 
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incidence matrix with rows indexed by the maximal cliques and columns 
indexed by V(G). 


8.1.20. Theorem. The following equivaient conditions on a graph G character- 
ize the interval graphs. 
A) G has an interval representation. 
B) G is a chordal graph, and G is a comparability graph. 
C) The clique-vertex incidence matrix has the consecutive 1s property. 


Proof: We leave A > B and A © C to Exercises 26-27, proving B => C here. 
Let G be a chordal graph such that G has a transitive orientation F. We use F 
and the absence of chordless cycles in G to establish an ordering on the maxi- 
mal cliques of G that exhibits the consecutive 1s property for the clique-vertex 
incidence matrix M. 

Let Q; and Q; be maximal cliques in G. By maximality, each vertex of one 
clique has a nonneighbor in the other. Suppose that under F, some edge of G 
points from Q; to Q; and some edge of G points from Q , to Q;. If these edges 
have a common vertex, then the transitivity of F forces an edge of a clique in 
G to belong to G. Hence the situation is as on the left below, with the (dashed) 
edges of F having four distinct vertices. If the two remaining pairs among these 
four vertices form edges in G, then G has an induced C,. Hence at least one 
diagonal is in G, but each possible o orientation of it in F contradicts transitivity. 
We conclude that all the edges of G between vertex sets Q; and Q; point in the 
same direction in F. 
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QO; Qj QO; QO; O; 

We can now define a tournament T with vertices corresponding to the max- 
imal cliques of G. We put Q; > Q; in T when all edges of F between Q; and 
Q; point from Q; to Q;. By the preceding paragraph, T is an orientation of a 
complete graph; We claim that 7 is transitive. To prove this we need to show 
that Q; —~ Q; and Q; > Q; imply Q; — Q,. Suppose that x > y and w — z 
in F with x € Q;, y,w € Q;, andz € Q. If y = w, transitivity of F immedi- 
ately implies x — z. Otherwise, consider a pair xz as shown on the right above. 
Joining x and z in G would form an induced C, in G, sox # z. Hence this pair 
appears in F,, and it must be directed from x — z to avoid violating transitivity. 
We conclude that Q; > Q; in T. 

A transitive tournament specifies a unique linear ordering of the vertices 
consistent with the edges; use the transitive tournament T to order the rows of 
M as Q; > ---— Qm. Suppose that under this ordering there is some column 
x where the 1s do not appear consecutively. Then we have Q;, Q;, Q, such that 
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i<j <k,x € Q;,Q:,x ¢ Q;. Since x ¢ Q;, the clique Q; must have some 
vertex y not adjacent to x, else Q; could absorb x and would not be maximal. 
Now x € Q; implies x — y in F, and x € Q,; implies y —> x in F, which cannot 
both happen. | 


The interval graphs form a relatively small family of perfect graphs. We 
next discuss larger classes that maintain sume of the nice properties of chordal 
graphs and comparability graphs. 


8.1.21. Definition. Classes of perfect graphs (conditions on odd cycles apply 
only for length at least 5). 
o-triangulated: every odd cycle has a noncrossing pair of chords. 
parity: every odd cycle has a crossing pair of chords. 
Meyniel: every odd cycle has at least two chords. 
weakly chordal: no induced cycle of length at least 5 in G or G. 
strongly perfect: every induced subgraph has a stable set meeting all its 
maximal cliques. 


Gallai [1962] proved that o-triangulated graphs are perfect. Every chordal 
graph is o-triangulated (Exercise 34) and weakly chordal (Exercise 40). All o- 
triangulated and parity yraphs are Meyniel graphs. Meyniel graphs are perfect 
(Meyniel [1976], Lovasz [1983]) and also strongly perfect (Ravindra [1982]). 

Parity graphs, shown to be perfect:in Olaru [1969] and Sachs [1970], carry 
that name due to a later characterization by Burlet and Uhry [1984]: G is a 
parity graph if and only if, for every pair x,y € V(G), the chordless x, y-paths 
are all even or all odd (Exercise 36). ~ 


8.1.22. Example. The graphs below exhibit differences amung these classes. 
Here T, C, O, P, M, W respectively denote the classes of chordal (Triangulated), 
comparability, o-triangulated, parity, Meyniel, and weakly chordal graphs. Sm 


Vw 


(WOC)-M 


SA 
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Strongly perfect graphs were introduced by Berge and Duchet [1984]. 
Changing maximal to maximum in the definition yields a weaker requirement 
equivalent to y-perfection; a stable set meeting all maximum cliques can be 
used as the first color class in an w(G)-coloring constructed inductively. Thus 
strongly perfect graphs are perfect. 

The class of strongly perfect graphs does not contain all Meyniel graphs 
or all weakly chordal graphs (Exercises 39-40), but it does contain all chordal 
graphs and all comparability graphs. (As observed in Proposition 5.3.25, when 
G has a transitive orientation, each induced subgraph inherits a transitive 
orientation, and the vertices with indegree 0 in this orientation form a stable 
set that meets all the maximal cliques.) 

Our next class is a subclass of the strongly perfect graphs (Exercises 37—38) 
that still contains all chordal graphs and comparability graphs. Introduced by 
Chvatal [1984], it has played an important role in the theory of perfect graphs. 


8.1.23. Definition. A perfect order on a graph is a vertex ordering such 
that greedy coloring with respect to the ordering inherited by each induced 
subgraph produces an optimal coloring of that subgraph. A perfectly or- 
derable graph is a graph having a perfect order. 

In an orientation of G, an obstruction is an induced 4-vertex path 
a,b, c,d whose first and last edges are oriented toward the leaves. ‘The ori- 
entation of G associated with a vertex ordering L orients each edge toward 
the vertex earlier in L: u < v ifu-< v. A vertex ordering is obstruction- 
free if its associated orientation has no obstruction. 


> 
tdlegr 
i>) 


a d 


The orientation associated with a perfect order is obstruction-free, because 
on an obstruction the greedy coloring would use three colors instead of two. 
Chvatal proved that a graph is perfectly orderable if and only if it has an 
obstruction-free ordering. The characterization implies that perfectly order- 
able graphs are perfect and that chordal graphs and comparability graphs are 
perfectly orderable. 


8.1.24. Example. Chordal graphs and comparability graphs are perfectly or- 
derable. The orientation of a chordal graph associated with a simplicial con- 
struction ordering has no induced u < v — w- A transitive orientation of a 
comparability graph has no induced u > v > w. 

Every orientation with an obstruction has both an induced u > v > w 
and an induced u < v > w. Hence if G is a comparability graph or a chordal 
graph, then G has an obstruction-free ordering. By Chvatal’s characterization, 
such graphs are perfectly orderable. | 


8.1.25. Lemma. (ChvAtal [1984]) Let G have a clique Q and a stable set S that 
are disjoint, and suppose that each w € Q is adjacent to some p(w) € S. If 
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L is an obstruction-free ordering of G such that p(w) < w for all w € Q, 
then some p(w) € S is adjacent to all of Q. 


Proof: We use induction on n(G). For the basis step n(G) = 1, there is nothing 
to prove. Consider n(G) > 1. For each w € Q, the graph G — w satisfies the 
hypotheses using the clique Q — w and the stable set {p(u) :u € O—w}. By the 
induction hypothesis, there is a vertex w* € Q — w such that p(w*) o Q —w. 
We obtain w € Q such that p(w*) = Q@ unless p(w*) # w for all w € Q. This 
assigns a unique w”* to every w, since p(w*) is nonadjacent only to w among 
Q. Mapping w to w* thus defines a permutation on Q. Since p(w) — w, the 
permutation has no fixed point. 

We seek an obstruction in the orientation associated with L. Let v be the 
least vertex of Q in L. Let b,c € Q be the vertices such that b* = v andc* = b 
(possibly c = v). Let a = p(b) and d = p(v). Because p(w*) # w, we have 
a #candd # b, which implies a ¥ d in the stable set S and yields the picture 
below for the orientation associated with L. 


1.me/fhath_ bdaks 


a= p(b) 


Because d = p(b*), the only vertex of Q nonadjacent to d is b; thus c 
d. Since d = p(v) < v < cin L, wethave d < c. Nowa,b,c,d induce an 
obstruction, which contradicts the hypothesis for L. Hence p(w*) < w for some 
w, and p(w”) is the desired vertex of S. | 


8.1.26. Theorem. (Chyatal [1984]) A vertex ordering of a simple graph G isa 
perfect order if and only if it is obstruction-free, and every graph with such 
an ordering is perfect. 


Proof: We have observed that the condition is necessary. Since the class of 
graphs with obstruction-free orderings is hereditary (the inherited ordering for 
an induced subgraph is also obstruction-free), it suffices to show that the greedy 
coloring of G relative to an obstruction-free ordering / is optimal. Let k be the 
number of colors used by the greedy coloring relauve to L. To prove optimality, 
we show that G has a k-clique; this alse inductively proves perfection. 

Let f: V(G) — [k] be the resulting coloring. Let i be the least integer such 
that G has a clique wj41...., w;, such that f(w;) = j. Since f uses color k on 
some vertex, i is well-defined. If i = 0, then G has a k-clique. 

Ifi > 0, then for each w; there is-a vertex p(w;) such that p(w;) < w; in L 
and f(p(w;)) =i; otherwise the greedy coloring would use a lewer color on w;. 
Let S = {p(w;41)...., p(w,x)}. Since all of S has the same color, S 1s a stable set. 
Hence the conditions of Lemma 8.1.25 are satisfied, and some vertex of S can 
be added to the clique to become w;. This contradicts the tninimaility of /. 
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Next we consider a different way of generating perfect graphs. An oper- 
ation that preserves perfection can enlarge a class of perfect graphs. Vertex 
multiplication, which expands each vertex into an independent set, is such 
a property. We generalize this. If V(G) = {v1,...,v,}, and Hy,..., H, are 
pairwise disjoint graphs, then the composition G[H;,...,H,] is the graph 
Hy +---+H, together with (xv: x € V(Hj), vy € V(H)), v;v; € E(G)}. The special 
case G[Ki1,..., Kn] is Goh. The example below uses H; = 2K1, Ho = Ko+K1, 
H3 = P3, Hy = Ko, and G = K 13 with central vertex v1. 


v1 


Lovasz proved that composition preserves perfection. This is one corollary 
of Chvatal’s Star-Cutset Lemma. 


8.1.27. Definition. A star-cutset of G is a vertex cut S containing a vertex x 
adjacent to all of S — {x}. A minimal imperfect graph is an imperfect 
graph whose proper induced subgraphs are all perfect. 


1 


1e/m 


8.1.28. Lemma. (The Star-Cutset Lemma Lemma) If G has no stable set inter- 
secting every maximum clique, and every proper induced subgraph of G is 
w(G)-colorable, then G has no star-cutset. 


Proof: Suppose that G has a star-cutset C, with w adjacent to all of C — {w}. 
Since G —C is disconneg¢ted, we can partition V(G — C) into sets V;. V2 with no 
edge between them. Let G; = G[V; UC], and let f; be a proper w(G)-coloring 
of G;. Let S; be the set of vertices in G; with the same color in f; as w; this 
includes w but no other vertex of C. Since there are no edges between V; and 
Vo, the union S = S; U Sz is a stable set. 

If Q is a clique in G — S, then Q is contained in G, — S, or in G2 — Sg. Since 
fi provides an w(G) — 1-coloring of G; — S;, we have |Q| < w(G) — 1. Since this 
applies to every clique Q in G — S, the stable set S meets every (G)-clique of 
G, which contradicts the hypotheses. | 


Cc 


8.1.29. Theorem. (The Star-Cutset Lemma, Chvatal [1985b]) No minimal im- 
perfect graph has a star-cutset. 
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Proof: If G is a minimal imperfect graph, then x(G) > w(G) and deletion of 
any stable set S leaves a perfect graph. Hence we have 


1+ @(G) < x(G) <1+x(G—-S)=1+(G—S) <1+(G). 


This yields w(G — S) = w(G), which means that no stable set meets every 
maximum clique. Furthermore, since G is minimally imperfect, every proper 
induced subgraph G’ satisfies x (G’) = w(G’) < w(G), making it w(G)-colorable. 
Lemma 8.1.28 now implies that G has no star-cutset. | 


The Replacement Lemma generalizes Lemma 8.1.4. 


8.1.30. Corollary. (Replacement Lemma—Lovasz [1972b]) Every composition 
of perfect graphs is perfect. 


Proof: A composition can be constructed by a sequence of substitutions in which 
a single vertex v of G; is replaced by a graph G2 and all edges added between 
V(G2g) and U = Ng,(v) to form a graph G. Hence it suffices to prove that this 
operation preserves perfection. If the resulting graph G is not perfect, then it 
contains a minimal imperfect induced subgraph F. Such a subgraph cannot be 
contained in G, or G2, which forces it te have at least two vertices of Go and at 
least one vertex of G,. 

If F has no vertex of G,; outside u, then F = F[U] V (F 1 Gg). The join 
operation preserves perfection, since xtH V H’)= x(H#)+ x(A’) and w(Hv HA’) = 
w(H) + w(H’) for all H, H’. Hence we-may assume that F has a vertex of G; 
outside U. In this case, V(F)NU together with one vertex of G2 in F isa 
star-cutset of F. Hence the replacement of v with G2 introduces no minimal 
imperfect subgraph F. a 


The Star-Cutset Lemma also yields perfection of weakly chordal graphs. 
Hayward [1985] proved that G or G has a star-cutset when G is a weakly 
chordal graph that is not a clique or stable set. With the Star-Cutset Lemma 
and the Perfect Graph Theorem, this implies that no weakly chordal graph is 
a minimal imperfect graph. Since the class is hereditary, it follows that every 
weakly chordal graph is perfect. 


IMPERFECT GRAPHS 


The p-critical graphs are the minimal imperfect graphs. The Strong Per- 
fect Graph Conjecture (SPGC) states that the only p-critical graphs are the odd 
cycles (of length at least 5) and their complements. With eneugh properties of 
p-critical graphs, perhaps we could prove that only odd cycles and their com- 
plements have all these properties; this would prove the SPGC. We begin with 
simple observations about p-critical graphs, some used earlier in discussing 
star-cutsets. (This presentation was originally modeled after Shmoys [1981].) 
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8.1.31. Lemma. If G is p-critical, then G is connected, G is p-critical, w(G) > 
2, and a(G) > 2. Furthermore, for every x € V(G), x(G — x) = w(G) and 
6(G —x) =a(G). 


Proof: G is perfect if and only if every component of G is perfect, and G is 
perfect if and only if.G is perfect. Cliques and their complemeuts are perfect. 
Finally, we observed in proving Theorem 8.1.29 that deleting a stable set from 
a p-critical graph cannot decrease the clique number. Since G — x is perfect, we 
thus have x(G — x) = w(G — x) = w(G). The condition 6(G — x) = a(G) is this 
statement for G. rT 


More subtle properties of p-critical graphs follow from Lovasz’s extension 
of the PGT. 


8.1.32. Theorem. (Lovasz [1972b]) A graph G is perfect if and only if 
w(G[A])a(G[A]) > |A| for all A C V(G). | 


The property “w(G[Al)a(G[A]) > |A| for all A C V(G)” was suggested 
by Fulkerson; we call it 6-perfection. It is implied by a-perfection or y- 
perfection; if we can color G with w(G) stable sets, then some stable set has at 
least n(G)/w(G) vertices. The converse involves counting arguments like those 
we gave for the PGT, but more delicate: Since £-perfection is unchanged under 
complementation, Theorem 8.1.32 immediately implies the PGT. 


At 


8.1.33. Theorem. If G is p-critical, then n(G) = a(G)w(G) + 1. Furthermore, 
for every x <¢ V(G), G — x has a partition into w(G) stable sets of size a(G) 
and a partition into a(G) cliques of size w(G). 


nN 


Proof: When G is p-critical, the condition for 8 -perfection fails only for the full 
vertex set A = V(G). Hence for each x € V(G) we have 


n(G) —1<a(G —x)w(G— x) =a(G)w(G) < n(G) —- 1. 


Therefore, n(G) = a(G)w(G) + 1. Since x(G — x) = w(G — x) = w(G), we can 
cover G — x by w(G) stable sets. Having size at most a(G), these sets partition 
the a(G)w(G) vertices of G — x into w(G) stable sets of size a(G). Similarly, 
0(G — x) = a(G — x) = a(G) yields a partition of V(G — x) into a(G) cliques of 
size w(G). a 


Study of p-critical graphs has benefitted by enlarging the class to include 
other graphs satisfying the properties in Theorem 8.1.33. Structural proper- 
ties of the larger class are useful when proving the SPGC for special classes of 
graphs. Padberg [1974] began the study of these graphs. Several definitions 
were suggested to extend the class of p-critical graphs but turned out to be al- 
ternative characterizations of the same class. The definition we use originates 
in Bland—Huang-Trotter [1979]. 


8.1.34. Definition. For integers a, w > 2, a graph G is a, w-partitionable if 
it has aw + 1 vertices and for each x € V(G) the subgraph G — x has a 
partition into a cliques of size w and a partition into w stable sets of size a. 
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8.1.35. Theorem. (Buckingham—Golumbic [1983]) A graph G of order aw + 1 
is a, w-partitionable if and only if x(G — x) = w and 6(G — x) = a for every 
x € V(G). Furthermore, w(G) = w and a(G) = a for such graphs, and the 
inequalities ¥(G — x) < w and @(G — x) < a suffice. 


Proof: Let G be partitionable. Since G — x is w-colorable and has a w-clique, 
x(G —x) = w= a@(G — x). Since a > 2, G is not a complete graph. Deleting a 
vertex x outside a maximum clique @Q in G yields w(G) = w(G — x) = w. The 
same arguments for G yield the results for a. 

Conversely, suppose that x(G — x) < w and 6(G — x) < a for every x in 
V(G). The latter inequality yields a(H) < a. Hence an optimal coloring of 
G — x uses at most w stable sets of size at most a. Since n(G — x) = aw, such 
a coloring partitions V(G — x) into w stable sets of size a. Similarly, a covering 
of G — x by a cliques yields the desired clique partition. a 


By Theorem 8.1.33 and Theorem 8.1.35, every p-critical graph is parti- 
tionable and every partitionable graph is imperfect. Furthermore, G is a, w- 
partitionable if and only if G is w, a-partitionable. 


8.1.36. Example. Cycle-powers. The graph C“ is constructed by placing n 
vertices on a circle and making each:vertex adjacent to the d nearest ver- 
tices in each direction on the circle.=« When d = 1, CF = On We view 
the vertices as the integers modulo #, in order. The graph Ci shown on 
the left below, is neither perfect nor p-critical (the vertices 0, 2, 4,6, 8 induce 
C5), but C? 10 is 3,3-partitionable. When i is removed, the unique partition 
of the remaining nine vertices into three triangles is {i + 1,i + 2,i + 3), 
G+4,i+5,1+6) @+7,14+8 i+ 9)}, and the unique partition into three 
stable sets is {(i +1, i+4,i+7),@4+2,1+5,i14+8),@4+3,1+6,i+9)}. 
Always Ce +1 18 4, w-partitionable. Every w consecutive vertices in G — x 
form a clique, and every a vertices spaced by jumps of length w form a stable 
set. Showing that Cy i is p-critical if and only if w = 2 or a = 2 reduces the 


SPGC to the statement that G is p-critical if and only if G = C“0) @yat- a 


8.1.37. Example. Other partitionable graphs. Other partitionable graphs arise 
by adding unimportant edges to Ci’, ice In Cay we can add any diagonal with- 
out changing the set of maximum cliques or the set of maximum stable sets, so 
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the resulting graph is still partitionable. We will see that the SPGC would fol- 
low if all partitionable graphs came from cycle-powers by adding unimportant 
edges of this type. 

Nevertheless, there are other partitionable graphs, such as the graph in 
the middle above (Chvatal-Graham—Perold—Whitesides [1979], Huang [1976]). 
Every edge in this graph belongs to a maximum clique, but it has two more 
edges than C2). The partitions demonstrating that it is partitionable differ 
from those used for C2, (Exercise 42). | 


8.1.38. Example. Further properties of Cy, cE The graph Cy er] has exactly n 
maximum cliques, each using w consecutive vertices on the cycle. Each vertex 
lies in w consecutive w-cliques. There are also exactly n maximum stable sets, 
each having a — 1 gaps of length w and one gap of length w + 1 between suc- 
cessive vertices. A maximum stable set containing x has a places for the larger 
gap, so each vertex x lies in a maximum stable sets. 

Finally, a w-clique can avoid a maximum stable set only by fitting inside 
the gap of length w + 1 (shown above Example 8.1.37 on the right). Thus there 
is a pairing {(Q;, 5;)} between the maximum stable sets and maximum cliques 
such that Q; 1 S; = @ if and only if i = j. a 


These “further properties” comprise the next characterization. The argu- 
ments are due to Padberg [1974], who-used them in a polyhedral characteriza- 
tion of perfect graphs. Here combinatorial conclusions follow from properties 
of matrices in linear algebra. Other characterizations of partitionable graphs 
appeared in Bland-Huang-Trotter [1979], Golumbic [1980, p58-62], Tucker 
[1977], Chvdtal—Graham-—Perold—Whitesides [1979], and Buckingham [1980]. 


8.1.39. Theorem. A graph G of order n = aw + 1 is a, w-partitionable if and 
only if both conditions below hold: 
1) a(G) =a and w(G) = w, and each vertex of G belongs to exactly w 
cliques of size w and a stable sets of size a. 
2) G has exactly n maximum cliques {Q;} and exactly n maximum 
stable sets {S;}, with Q; S; = @ if and only ifi = j (Q; and S; are mates). 


Proof: Necessity. We have proved x(G — x) = w = w(G) and 0(G— x) =a = 
a(G) for each x € V(G). Choose a clique @Q of size w. For each x € Q, G—x has 
a partition. into a cliques of size w. Together, Q and these w partitions form a 
list of n = aw +1 maximum cliques Q),..., Q,. Each vertex outside Q appears 
in one clique in each partition. Each vertex in Q appears in Q and once in w —1 
partitions. Hence every vertex appears in exactly w cliques in the list. 

For each Q;, we obtain a maximum stable set 5; disjoint from Q;. Choose 
x € Q;. The w maximum stable sets that partition V(G — x) can meet Q; only 
at the w — 1 vertices other than x. Therefore, one of these stable sets misses Q;; 
call it S;. We will show that these two lists contain all the cliques and stable 
sets and have the desired intersection properties. 

Let A be the incidence matrix with a;,; = 1ifx; € Q; and a;,; = 0 otherwise. 
Let B be the matrix with b; ; = 1 if x; € S; and b; ; = 0 otherwise. The ijth 
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entry of AB’ is the dot product of row i of A with row j of B, which equals 
|Q;S;|. By proving that AB’ = J — 1, where J is the matrix of all 1s, we 
obtain Q; NS; 4 Oif and only ifi 4 j. Since J — / is nonsingular, this will also 
imply that A and B are nonsingular. Nonsingular matrices have distinct rows, 
and hence Q;,..., Q, and S;,..., 5, will be distinct. 

By construction, |Q; 1 S;| = 0. Since cliques and stable sets intersect at 
most once, to prove that AB’ = J — J we need only show that each column of 
AB? sums ton — 1. Multiplying by the row vector 17 on the left computes these 
sums. We constructed A so that each column has w 1s (because each vertex 
appears in w cliques in the list) and B so that each row has a 1s (because each 
stable set has size a). Therefore, 


17(AB’) = (17 A)B? = w17B? = wal, =(n-1)17. 


To prove that G has no other maximum cliques, we let g be the incidence 
vector of a maximum clique Q and show that g must be a row of A. Since A 
is nonsingular, its rows span R", and we can write g as a linear combination: 
q = tA. To solve for t, we need A~!. Since every row of A sums to w, we have 
A(w J — BT) = w-!wJ — (J — 1) =I, and hence A~! = w~1J — B’. Thus, 


t=qA+=q(w lJ — B') =o 1qJ —qB" =o 'wl! — qB’. 


The ith column of B7 is the incidence-vector of S;; hence coordinate i of qB™ 
equals |Q 1 S;|, which is 0 or 1. Hence+ is a 0,1-vector and-g is a sum of rows 
of A. Since g sums to w, only one row can be used. Thus g is a row of A and 
Q1,...,Q, are the only maximum cliques. 

The same argument applied to G shows that G has exactly n maximum 
stable sets, with each vertex appearing in a of them. 


Sufficiency. By Theorem 8.1.35, we need only prove that x(G — x) < w and 
6(G — x) <a for all x € V(G). Given the cliques and stable sets as guaranteed 
by condition (2), define the incidence matrices A, B as above. By condition 
(1), each column of B has a 1s, and hence JB = aJ = BJ. The intersection 
requirements in condition (2) yield AB’ = J — J. This is nonsingular, so B is 
nonsingular and 


A’B=B BATB=B (J -NB=B'BJ-Il=J-1. 


In the product A’ B = J — J, the row corresponding to x € V(G) states that 
V(G —x) is covered by the mates of the w maximum cliques containing x (illus- 
trated below), and hence x(G — x) < w. Similarly, the column corresponding 
to x states that V(G — x) is covered by the mates of the a maximum stable sets 
containing x, and hence 6(G — x) <a. a 


Section 8.1: Perfect Graphs 339 


8.1.40. Corollary. If G is a, w-partitionable and w = 2, then G = Co,41; if 
a = 2, then G = Co,,.41. Hence the SPGC reduces to showing that p-critical 
graphs have w = 2 ora = 2. 


Proof: If w = 2, then every vertex belongs to exactly two cliques of size 2, so G 
is 2-regular. Furthermore, G is connected and has odd order (2a + 1), so G is 
an odd cycle. For « = 2, consider G. a 


Henceforth we use w’, w(G), w interchangeably and a, a(G), a interchange- 
ably for partitionable graphs. 


8.1.41. Theorem. (Tucker [1977]) Let + be a vertex in a partitionable graph 
G. The subgraph G — x. has a unique minimum coloring; denoted X(G — x), 
it consists of the mates of the maximum cliques containing x. Similarly, 
G — x has a unique minimum clique covering X(G — x) consisting of the 
mates of the maximum stable sets containing +. 


Proof: Since G is a. w-partitionable, G — x is w-colorable using w stable sets 
of size a. Every w-clique containing . misses some color class, since the clique 
has only w — 1 vertices in G — x. Thus all w-cliques containing + have mates 
as color classes in the coloring. There are exactly w of these, so the coloring is 
unique. The other statement follows by complementation. a 


8.1.42. Theorem. (Buckingham—Golumbic [1983]) If x is a vertex of an a, w- 
partitionable graph G, then 2w — 2 < d(x) <n —2a+4+1. 


Proof: Select a vertex v # + (see illustration above). Let S be the stable set in 
X(G — v) that contains x, and let S’ be-another stable set in X(G — v). Choose 
z € N(x) MN Sg. In @(G — =), some clique Q contains x. Since v # x, Q has one 
vertex in each stable set of X(G — v), including S’. Since Q € ©(G — z) implies 
z ¢ Q, this yields a second neighbor of x in S’. Thus x has at least two neighbors 
in each of the w — 1 stable sets in X(G — v), yielding d(x) > 2m — 2. The same 
argument in G yields n — 1 — d(x) = |Ng(x)| > 2a - 2. | 


These bounds on vertex degrees in a, w-partitionable graphs are sharp, as 
they hold with equality for powers of cycles. 


8.1.43. Definition. An edge of a graph is critical if deleting it increases the in- 
dependence number. A pair of nonadjacent vertices is co-critical if adding 
it increases the clique number. 
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The characterization of critical edges in partitionable graphs is implicit in 
the work of several authors. 


8.1.44. Theorem. For an edge xy in a partitionable graph G, the following 
statements are equivalent. 
A) xy is a critical edge. 
B) SU {x} € X(G — y). 
C) xy belongs to w — 1 maximum cliques. 


Proof: B => A. SU {x, y} is a stable set of size a + 1 in G — xy. 

A= C. If xy is critical, then there is a set S such that SU {x} and SU{y} are 
maximum stable sets in G. Hence every maximum clique containing x but not 
y is disjoint from S U {y}. Since there are w maximum cliques containing x and 
only one maximum clique disjoint from S U {y}, the remaining w — 1 maximum 
cliques containing x must also contain y. 

C = B. The stable sets in the unique coloring of G — x are the mates of the 
cliques containing x. Since xy belongs to w — 1 maximum cliques, the mates of 
these w — 1 cliques belong to both X(G — x) and X(G — y). This leaves only a+1 
vertices in the graph, consisting of the vertices x, y and a stable set § such that 
SU{y} €X(G — x) and SU {x} e X(G — y). os 


8.1.45. Corollary. Let G be a partitionable graph. If xy is an edge appearing 
in no maximum clique, then G — xy-is partitionable. Ifx, y is a nonadjacent 
pair appearing in no maximum stable set, then G + xy is partitionable. 


Proof: By complementation, we need. only prove the first statement. If we 
delete an edge appearing in no maximum clique, then by Theorem 8.1.44 it 
is not a critical edge, and we have w(G — xy) = w(G) and a(G — xy) = a(G). 
Since we have not destroyed any maximum clique and have not created a bigger 
stable set, we can use the optimal coloring and clique partition of G — u to 
conclude that x(G — xy — u) < w and 6(G — xy —u) < a. Hence G — xy is 
partitionable, by Theorem 8.1.35. a 


The discussion in Example 8.1.37 suggests that edges appearing in no max- 
imum clique are uninteresting “junk”. Corollary 8.1.45 assures us that “junk 
is junk”. The partitionable cycle-powers have no junk. 


THE STRONG PERFECT GRAPH CONJECTURE 


We have been proving properties of partitionable graphs.in a “top down” 
approach to the SPGC, trying to find enough properties to eliminate all but odd 
cycles and their complements as p-critical graphs. The “bottom up” approach 
is to verify that the SPGC holds on larger and larger classes of graphs, until all 
are included. 


8.1.46. Definition. An odd hole or odd antihole in G is an induced subgraph 
of G that is Co,.41 or Co.41 (for some k > 2), respectively. A graph having 
no odd hole or antihole is a Berge graph. 


Section 8.1: Perfect Graphs 341 


One way to prove that a class G satisfies the SPGC is to prove that every 
Berge graph in G is perfect. A hereditary class G satisfies the SPGC if the odd 
cycles and their complements are the only p-critical graphs in G. 

The SPGC holds for planar graphs (Tucker [1973]), toroidal graphs (Grin- 
stead [1981]), graphs with A(G) < 6 (Grinstead [1978]) or w(G) < 3 (Tucker 
[1977]), and for various classes defined by forbidding fixed small induced sub- 
graphs (Meyniel [1976], Tucker [1977], Parthasarathy—Ravindra [1976, 1979], 
Chvatal—Sbihi [1988], Olariu [1989], Sun [1991]). We consider three families. 


8.1.47. Definition. A circular-arc graph is the intersection graph of a family 
of arcs of a circle. A circle graph is the intersection graph of a family of 
chords of a circle. A K,3-free graph is a graph not having K;3 as an 
induced subgraph. 


Every cycle is both a circle graph and a circular-arc graph, but neither of 
these classes contains the other (Exercise 47). 

One way to prove the SPGC for a class G is to show that every partitionable 
graph in G belongs to another class H where the SPGC is known to hold. In 
this role we use the class {C“}. 

8.1.48. Theorem. (Chvatal [1976]) Cyele-powers satisfy the SPGC. In particu- 
lar, the graph C""~, is p-critical if and only if w = 2 or a = 2, in which case 


the graph is an odd hole or antihole. 


Proof: It suffices to consider the partitionable graph G = Cy, ve This is p- 
critical when a = 2 or w = 2, so we may assume a,w > 2. Let the vertices 
be {vg...., Vaw}, and let S = (vjy41,UG41e: O < i < a—1}. The subgraph G[S] 
is a cycle, since the indices of consecutive vertices in S are separated by 1 or 
w — 1 (except that v,,, and v; are separated by 2), and indices of nonconsecutive 
vertices differ by at least w. To obtain Cg,_1 as a proper induced subgraph, we 
replace {U(g—1w+ Vaws V1. Vw} With (v¢g—1)w41. Vo. V»—1} In S. We conclude that G is 
not p-critical. | 


8.1.49. Theorem. (Tucker [1975]) The SPGC holds for circular-arc graphs. 


Proof: Recall that N[v] denotes N(v) U {v}, the closed neighborhood of v (Def- 
inition 3.1.29). When G is partitionable with distinct vertices x, y, we claim 
that N[x] Z Nl[y]. Consider the clique Q containing x in ©(G — y); we have 
OC N[x]. If N[y] contains N[x], then Q U {y} is a clique of size w(G) + 1. 
Now, if G is a partitionable circular-arc graph, it suffices to show that G = 
C*)-1 because the SPGC holds for cycle-powers (Theorem 8.1.48). Consider 
a circular-arc representation that assigns arc A, to x € V. Since N[y] cannot 
contain N[x], the arc A, cannot lie within another arc A, of the representation. 
If no arc contains another, then every arc that intersects A, contains exactly 
one of its endpoints. Since the vertices corresponding to the arcs containing 
one point induce a clique, there are at most w — 1 other arcs containing each 
endpoint of A,. Equality holds, since Theorem 8.1.42 requires 5(G) > 2w — 2. 
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Starting from a given point p an the circle, let v; be the vertex represented 
by the ith arc encountered moving clockwise from p. Since each arc meets 
exactly w — 1 others at each endpoint, v; is adjacent to vj,1..... Vit-w-1 (addition 
modulo n) for each i. Hence G = C?"?. 7 


The original proof of the SPGC for claw-free graphs (Parthasarathy— 
Ravindra [1976]) was quite intricate. Further study of p-critical graphs has 
shortened both it and the proof of the next theorem, which we will apply. 


8.1.50. Theorem. (Giles—Trotter—Tucker [1984]) If a partitionable graph G 
has a cycle consisting of critical edges, then the subgraph G’ obtained by 
deleting the edges belonging to no maximum clique is C°~! 


Proof: (Hartman [1995]) Suppose that G is a, w-partitionable. Deleting edges 
destroys no stable set. Deleting edges in no maximum clique destroys no 
maximum clique. Hence the coloring and clique covering of G — x also yield 
x(G' —x) < w and 6(G’ — x) < a (regardless of whether a(G’) > a(G)). By The- 
orem 8.1.35, G’ is thus a, w-partitionable. Also, the clique coverings of G’ — x 
for various x force G’ to be connected. 

We next prove that if G has a u, v-path consisting of k critical edges, then 
u and v belong to at least w — k common maximum cliques. We use induction 
on k, with Theorem 8.1.44 providing the basis step, k= 1. For k > 1, if y is the 
vertex before v on such a path, then the induction hypothesis puts u and y in 
w — k + 1 common maximum cliques. Since y belongs to exactly w maximum 
cliques (by Theorem 8.1.39), and w — Lof these contain v (by Theorem 8.1.44), 
at most one of the w — k + 1 cliques containing u and y can omit v. 

Let C be a cycle of critical edges in G. Critical edges belong to maximum 
cliques, so C remains in G’. As shown above, w vertices forming a path in 
G’ induce a maximum clique in G’. If the length of C exceeds w, then this 
establishes w successive maximum cliques containing a given vertex x of C. By 
Theorem 8.1.39, these are all the maximum cliques of G containing x, and hence 
they include all the edges of G’ incident to x. Hence C is a component of G’, but 
G’ is connected, so C contains all vertices of G’. This expresses G’ as C°"1. 

If the length of C is at most w, then V(C) itselfis a clique. Ifx ¢ V(C), then 
the vertices of C —x belong to distinct stable sets in the coloring X(G —.x) defined 
by Theorem 8.1.41. Let xo, ..., x, be the vertices of C in order. Let S,,..., 5; be 
the stable sets in G — V(C) such that S; U {x;} € X(G — xo). Because x;x;,1 is a 
critical edge, x; and x;,; belong to w — 1 common maximum cliques (Theorem 
8.1.44), and hence by Theorem 8.1.41 the colorings X(G — x;) and X(G — x;+1) 
have w — 1 common stable sets. The remaining set differs only in having x; or 
Xi41- Hence X(G — x1) contains S; U {x;} for i > 2, and it also contains S, U {xo}. 

Continuing these substitutions while following the edges of C, we find that 
X(G —x,) contains S; U{x;_1} for 1 <i <k. Taking one more step to return to xo, 
we find that X(G — x9) contains S; U {x;_1} for 2 < i < k and contains Sj U {x;}. 
Since k > 2anda > 2, these sets are different from our initial sets in X(G — x9). 
Since the coloring X(G — xq) is unique, we have obtained a contradiction, and 
the case n(C) < w does not arise. a 
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8.1.51. Theorem. (Chvatal [1976]) If G is a p-critical graph such that the 
spanning subgraph G’ obtained by deleting the edges of G belonging to no 
maximum clique is a cycle-power C4, then G is an odd hole or odd antihole 
(and equals G’). 


Proof: A p-critical graph is partitionable. The stable sets and maximum cliques 
in G are stable sets and cliques in G’, and by Theorem 8.1.35 we again conclude 
that G’ is partitionable with a(G’) = a(G) = a and w(G’) = w(G) = w. Hence 
G’ = C¥~|.. We index the vertices so that the maximum cliques of G’ (and G) 
consist of w cyclically consecutive vertices, and the maximum stable sets have 
the form v;, vj, y,---.; Vitaw- In particular, vertices separated by a multiple of w 
on the cycle vg, ..., vay are nonadjacent in G’ and in the full graph G. 

If G’ = G, then Theorem 8.1.48 implies that G is an odd hole or odd anti- 
hole. If G’ # G, then a, w > 2, since otherwise deleting an edge increases the 
number of maximum stable sets or decreases the number of maximum cliques. 

For a, w > 3, we exhibit an imperfect proper induced subgraph H of G (the 
induced odd cycle in G’ obtained in Theorem 8.1.48 may have a chord in G). Let 
S = (vaw, V1, Uw, Vw 42} U{Vin41: 2<i< a—l}, and let T = {U(a—1)w+1; Vaws V1, Vyw}U 
{Vpij: 2 <i < w-—1}. Thesets S and T have sizes a+2 and w+2, and fora, w > 
3 they share exactly the five vertices {v(q—1)w+, Vaw, V1, Vw, Vw42}. Furthermore, 
S intersects every maximum clique of:G’ (and hence of G), and T intersects 
every maximum stable set of G’ (and hence of G) (Exercise 49). Letting H = 
G — (SUT), this yields a(H) = a — l-and w(H) = w — 1. Now imperfection 
follows from d 


n.me/1 


4-5) > (a—-1)w-D. | 


n(H) > n(G) —-(a+wt 
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8.1.52. Corollary. (Giles—Trotter—Tucker [1984]) If G is a p-critical graph and 
for each v € V(G) the minimum coloring X(G — v) has (at least) two sets 
that.each contain exactly one neighbor of v, then G is an odd hole or an odd 
antihole. 


Proof: When some set in X(G — v) has exactly one neighbor u of v, the edge uv 
is critical. Hence the hypothesis implies that the subgraph of critical edges has 
minimum degree at least 2 and therefore contains a cycle. By Theorem 8.1.50, 
the subgraph G’ obtained by deleting the edges belonging to no maximum clique 
is C’-!. By Theorem 8.1.51, G is an odd hole or an odd antihole. a 


8.1.53. Corollary. (Parthasarathy—Ravindra [1976]) The SPGC holds for K;,3- 
free graphs. 


Proof: (Giles—Trotter-Tucker [1984]) Let G be a p-critical K,,3-free graph. For 
each v € V(G), N(v) induces a perfect subgraph having no stable set of size 
3. This means that N(v) can be covered by two cliques, which implies d(v) < 
2w(G) — 2. Each of the w(G) stable sets in X(G — v) contains a neighbor of v, 
else adding v creates a larger stable set. With d(v) < 2w(G) — 2, at least two of 
these sets have exactly one neighbor of v. Hence G satisfies the hypothesis of 
Corollary 8.1.52, and G is an odd hole or antihole. | 
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Corollary 8.1.53 also yields the SPGC for circle graphs (Exercise 50). The 
general SPGC remains open, but a result intermediate between it and the PGT 
is known (it is immediately implied by the SPGC and immediately implies the 
PGT). Chvatal conjectured that if G and H have the same vertex set and have 
the same 4-tuples of vertices that induce P,, then G is perfect if and only if H 
is perfect. Reed [1987] proved this “Semi-Strong Perfect Graph Theorem”. 


EXERCISES 


8.1.1. (—) Compute x(G) and w(G) for the complement of the odd cycle Co;.1. 
8.1.2. (—) Determine the smallest imperfect graph G such that x(G) = w(G). 


8.1.3. (!)P4-free graphs are also called cographs, which stands for “complement re- 
ducible”. A graph is complement reducible if it can be reduced to an empty graph by 
successively taking complements within components. 

a) Prove that a graph G is P,-free if and only if it is complement reducible. 

b) Use part (a) and the Perfect Graph Theorem to prove that every P,-free graph is 
perfect. (Seinsche [1974]) 


8.1.4. Clique identification. Suppose that G = G,UGo, that G; NG; is aclique, and that 
G, and Gz are perfect. Without using the Star-cutset Lemma, prove that G is perfect. 


8.1.5. Find an imperfect graph G having.a star-cutset C such that the C-lobes of G 
are perfect graphs. (Comment: Thus identification at star-cutsets does not preserve 
perfection, although no p-critical graph has a star-cutset.) 


8.1.6. Let G be a cartesian product of complete graphs. Prove that a(G) = 6(G). Prove 
that Ko 0 K20 K3 is not perfect. 


8.1.7. Prove that C; Vv K, is the only color-critical 4-chromatic graph with six vertices. 


8.1.8. (+) Prove that G is an odd cycle if and only ifa(G) = (n(G) — 1)/2 and a(G — u — 
v) = a(G) for all u, v € V(G). (Melnikov—Vizing [1971], Greenwell [1978]) 


8.1.9. Let v,,...,v, be a simplicial elimination ordering of G, and let Q(v;) = {v, € 
N(v;): j > i}. Note that Q(v;) is the clique of neighbors of v; at the time when »; is 
deleted in the elimination ordering. Let 5 = {y,..., ¥,} be the stable set obtained 
“greedily” from the ordering 1i,..., v,; that is, set yj = v1, discard N(y,) from the re- 
mainder of the ordering, and proceed iteratively, at each step adding the least remaining 
element x to the stable set and discarding what remains of Q(x). 

a) Prove that applying the greedy coloring algorithm to the construction order- 
ing v,,...,¥; yields an optimal coloring and that w(G) = 1 + max)? yg) |Q()I. 
(Fulkerson—Gross [1965]) 

b) Prove that S is a maximum stable set and that the sets {y;} U Q(y;) form a 
minimum clique covering. (Gavril [1972]) 


8.1.10. Add a test to the MCS algorithm to check whether the resulting ordering is a 
simplicial elimination ordering. (Tarjan—Yannakakis [1984]) 


8.1.11. Prove directly (without using a simplicial elimination ordering) that the inter- 
section graph of a family of subtrees of a tree has no chordless cycle. 
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8.1.12. (—) Prove that every graph is the intersection graph of a family of subtrees of 
some graph. 


8.1.13. Prove that every chordal graph has an intersection representation by subtrees 
of a host tree with maximum degree 3. 


8.1.14. Let Q be a maximal clique in a connected chordal graph G. For all x € V(G), 
prove that Q has two vertices whose distances from x are different. (Voloshin [1982]) 


8.1.15. Intersection graphs of subtrees of a graph. A fraternal orientation of a graph 
is an orientation such that any pair of vertices with a common successor are adjacent. 

a) (—) Prove that a graph is chordal if and only if it has an acyclic fraternal orien- 
tation. 

b) (—) Obtain a graph with no fraternal orientation. 

c) A family of trees in a graph is rootable if the trees can be assigned roots so that 
a pair of them intersects if and only if at least one of the two roots belongs to both sub- 
trees. Prove that G has a fraternal orientation if and only if G is the intersection graph 
of a rootable family of subtrees of some graph. (Gavril—Urrutia [1994]) 


8.1.16. (!) Prove that a simple graph G is a forest if and only if every pairwise intersect- 
ing family of paths in G has a common vertex. (Hint: For sufficiency, use induction on 
the number of paths in the family.) 


8.1.17. (!) Forbidden subgraph characterization of split graphs. A graph is a split 
graph if its vertices can be partitioned into a clique and a stable set. 

a) Prove that if G is a split graph, then G and G are chordal graphs. Observe that 
if G and G are chordal graphs, then G has no induced subgraph in {C4, 2K2. Cs}. 

b) Prove that if G is a simple graph with no induced subgraph in {C4. 2K. Cs}, then 
G is a split graph. (Hint: Among the maximum-sized cliques, let Q be one such that 
G — Q has the minimum number of edges.Prove that G — Q is a stable set, using the 
choice of Q and the forbidden subgraph conditions.) (Hammer—Simeone [1981]) 


8.1.18. Let d, > -.-- > d, be the degree sequence of a simple graph G, and let m be 
the largest value of k such that d, > k — 1. Prove that G is a split graph if and only if 
yo ai = m(m — 1) + -"_,, d:;. (Comment: Compare with Exercise 3.3.28.) (Hammer- 
Simeone [1981]) 


8.1.19. (—) Determine the trees that are split graphs, and construct a pair of noniso- 
morphic split graphs with the same degree sequence. 


8.1.20. The k-trees are the graphs that arise from a k-clique by 0 or more iterations of 
adding a new vertex joined to a k-clique in the old graph. Prove that G is a k-tree if and 
only if G satisfies the following three properties: 

1) G is connected. 

2) G has a k-clique but no k + 2-clique. 

3) Every minimal vertex separator of G is a k-clique. 


8.1.21. Let G be an n-vertex chordal graph having no clique of order k + 2. Prove that 
e(G) < kn — (*5'), with equality if and only if G is a k-tree. 


8.1.22. (+) Generalize Theorem 2.2.3 (Cayley’s Formula) by proving that the number 
of k-trees with vertex set [n] is (i)tk(n — k) + 1)"-*-?. (Hint: Generalize the Priifer code 
for rooted trees, which generates a list with n — 1 entries and never deletes the root. 
In a k-tree, the vertices belonging to exactly one k + 1-clique are the leaves. A k-tree 
can be grown using any k-clique as a root. The lists generated from k-trees with a fixed 
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root have as symbols 0 and pairs ij, where i comes from some k-set and j from some 
n — k-set.) (Greene—Iba [1975]; other proofs in Beineke—Pippert [1969], Moon [1969]) 


8.1.23. Suppose that G is a chordal graph with w(G) = r. Prove that G has at most 
(5) + (3) (n —r) cliques of size j, with equality (for all j simultaneously) if and only if 
G is an r — 1-tree. 


8.1.24. The Helly property of the real line. Suppose that /;,..., /, are pairwise inter- 
secting real intervals. Prove that /;,..., J, have a common point. 


8.1.25. Prove directly that a tree is an interval graph if and only if it is a caterpillar (a 
tree having a path that contains at least one vertex of each edge). 


8.1.26. (!) Let G be an interval graph. Prove that G is a comparability graph and that 
G is a chordal graph. (Hint: Establish a simplicial elimination ordering.) 


8.1.27. Prove that a graph G has an interval representation if and only if the clique- 
vertex incidence matrix of G has the consecutive 1s property. 


8.1.28. Prove that G is an interval graph if and only if the vertices of G can be ordered 
Vj,..., U, such that v; < vy implies v; < v, whenever i < j < k. (Jacobson—McMorris— 
Mulder [1991], for example) 


8.1.29. An asteroidal triple in a graph is a triple of vertices x, y, z such that between 
any two there exists a path avoiding the neighborhood of the third. Prove that no aster- 
oidal triple occurs in an interval graph. (Comment: Interval graphs are precisely the 
chordal graphs that have no asteroidal triples) (Lekkerkerker—Boland [1962])) 


x x 
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8.1.30. Six professors visited the library on the day the rare book was stolen. Each 
entered once, stayed for some time, and then left. For any two of them that were in the 
library at the same time, at least one of them saw the other. Detectives questioned the 
professors and gathered the following testimony: 


PROFESSOR CLAIMED TO HAVE SEEN 


Abe Burt, Eddie 
Burt Abe, Ida 
Charlotte Desmond, Ida 
Desmond Abe, Ida 
Eddie Burt, Charlotte 
Ida Charlotte, Eddie 


In this situation, “lying” means providing false information, not omitting information. 
Assume that the culprit tried to frame another suspect by lying. If one professor lied, 
who was it? (Golumbic [1980, p20]) 


8.1.31. (+) Prove that G is a unit interval graph (representable by intervals of the sare 
length) if and only if A(G) + J has the consecutive 1s property. (Roberts [1968]) 
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8.1.32. (+) Prove that G is a proper interval graph (representable by intervals such 
that none properly contains another) if and only if the clique-vertex incidence matrix of 
G has the consecutive 1s property for both rows and columns. (Fishburn [1985] 


8.1.33. (—) Prove that every P,-free graph is a Meyniel graph. 
8.1.34. (!) Prove that every chordal graph is o-triangulated. 


8.1.35. Let C be an odd cycle in a graph with no induced odd cycle. Prove that V(C) has 
three pairwise-adjacent vertices such that paths joining them in C all have odd length. 


8.1.36. (+) Prove that the conditions below are equivalent. 

A) Every odd cycle of length at least 5 has a crossing pair of chords. 

B) For every pair x, y € V(G), chordless x, y-paths are all even or all odd. 
(Hint: For A => B, consider a pair P;, P2 of x, y-paths with opposite parity such that the 
sum of their lengths is minimal.) (Burlet-Uhry [1984]) 


8.1.37. Prove that every perfectly orderable graph is strongly perfect. (Hint: Use 
Lemma 8.1.25) (Chvatal [1984]) 


8.1.38. (!) Prove that the graphs below are strongly perfect but not perfectly orderable. 


e/math_books 


8.1.39. (—) Prove that the graph on the left above is a Meyniel graph but is not perfectly 
orderable. Prove that the graph Ps is perfectly orderable but is not a Meyniel graph. 


8.1.40. (!) Weakly chordal graphs. 
a) Prove that every chordal graph is weakly chordal. 
b) Prove that the graph below is weakly chordal but not strongly perfect. 


TX 


8.1.41. (—) A skew partition of G is a partition of V(G) into two nonempty sets X, Y 
such that G[X] is disconnected and G[Y] is disconnected. Chvatal [1985b] conjectured 
that no minimal imperfect graph has a skew partition. Prove that this implies the 
Star-Cutset Lemma and is implied by the SPGC. 


8.1.42. Prove that the 10-vertex graph in Example 8.1.37 is 3, 3-partitionable. (Chvatal— 
Graham—Perold—Whitesides [1979]) 


8.1.43. (—) Let x and v be vertices of a partitionable graph G. Prove that if x # v, 
then every maximum clique containing x consists of one vertex from each stable set that 
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is the mate of a clique containing v. State the complementary assertion when x < v. 
(Buckingham—Golumbic [1983]) 


8.1.44. (+) Prove that no p-critical graph has antitwins, which are a pair of vertices 
such that every other vertex is adjacent to exactly one of them. (Hint: Given a p-critical 
graph with antitwins {x, y}, let S be the stable set containing y in the unique optimal 
coloring of G — x. Find among the vertices of the w — 1-colorable subgraph G — x — S an 
w — 1 clique in N(x) that doesn’t extend into N(y). Similarly, find a stable set in N(y) 
that doesn’t extend into N(x). Now build an induced 5-cycle.) (Note: The partitionable 
graph of Example 8.1.37 has antitwins.) (Olariu [1988]) 


8.1.45. Vertices x, y form an even pair if every chordless x, y-path has even length 
(number of edges). Twins (nonadjacent vertices with the same neighborhood) are a 
special case. 

a) Suppose that $,, Sy are maximum stable sets in a partitionable graph G. Prove 
that the subgraph of G induced by the symmetric difference of S; and Sz is connected. 
(Bland—Huang-Trotter [1979]) 

b) Use part (a) to prove that no p-critical graph has an even pair. (Comment: Hence 
no p-critical graph has twins, which proves yet again that vertex duplication preserves 
perfection.) (Meyniel [1987], Bertschi—Reed [1988]) 


8.1.46. Let G be a partitionable graph, and let 5, Sz be stable sets in the optimal col- 
oring of G — x. Use part (a) of the preceding problem to prove that the subgraph of G 
induced by S) U Sp, U {x} is 2-connected. (Buckingham—Golumbic [1983]) 


8.1.47. Prove that one graph below is a circle graph but not a circular-arc graph, and 
prove that the other is a circular-arc graph but not a circle graph. 
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8.1.48. (!) The graph K;3 + is the 4-vertex graph obtained by adding one edge to K, 3. 
Using the perfection of Meyniel graphs, prove that K,,3 +e-free graphs satisfy the SPGC. 
(Meyniel [1976]) 


8.1.49. Let G = Ce Let S = {vew, 01, Vw, Vwr2} U {¥jy41: 2 < i < a — 1), and let T = 
{U(q—1)w41s Vaws V1, Vw} U {Uy4i: 2 < i < w— 1}. Prove that S intersects every maximum 


clique of G and that T intersects every maximum stable set of G. (Chvatal [1976]) 


8.1.50. (!) SPGC for circle graphs. (Buckingham—Golumbic [1983]) 

a) Use Lemma 8.1.28 to prove that if x is a vertex in a partitionable graph G, then 
G — N[x] is connected, where N[x] = N(x) U {x}. 

b) Use part (a) to prove that partitionable circle graphs are K,.3-free. 

c) Conclude from part (b) and Corollary 8.1.53 that the SPGC holds for circle graphs. 
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8.2. Matroids 


Many results of graph theory extend or simplify in the theory of matroids. 
These include the greedy algorithm for minimum spanning trees, the strong 
duality between maximum matching and minimum vertex cover in bipartite 
graphs, and the geometric duality relating planar graphs and their duals. 

Matroids arise in many contexts but are special enough to have rich combi- 
natorial structure. When a result from graph theory generalizes to matroids, it 
can then be interpreted in other special cases. Several difficult theorems about 
graphs have found easier proofs using matroids. 

Matroids were introduced by Whitney [1935] to study planarity and alge- 
braic aspects of graphs, by MacLane [1936] to study geometric lattices, and by 
van der Waerden [1937] to study independence in vector spaces. Most of the 
language comes from these contexts. Here we emphasize applications to graphs. 


HEREDITARY SYSTEMS AND EXAMPLES 


ut 


In many mathematical contexts, we study sets that avoid conflicts; often 
this is called “independence”. Inherent in this notion is that subsets of inde- 
pendent sets are independent, and the empty set is independent. 


1 


8.2.1. Example. Acyclic sets of edges. Let E be the edge set of a graph G, 
and let. X C E be “independent” if it contains no cycle. Every subset of an 
independent set is independent, and the empty set is independent. The cycles 
are the minimal dependent sets. 

Consider the kite K,—e, which has five edges. Since spanning trees of this 
graph have three edges, every set having more than three edges is dependent. 
Also the two triangles are dependent; this yields eight dependent sets and 24 
independent sets among the subsets of E. There are three minimal dependent 
sets (the cycles) and eight maximal independent sets (the spanning trees). & 


8.2.2. Definition. A hereditary family or ideal is a collection of sets, F, such 
that every subset of a set in F is aiso in F. A hereditary system M on 
E consists of a nonempty ideal Iy of subsets of E and the various ways of 
specifying that ideal, called aspects of M. 

The elements of Iy are the independent sets of M. The other sub- 
sets of E (comprising the family Dy) are dependent. The bases are the 
maximal independent sets, and the circuits are the minimal dependent 
sets; By and Cy denote these families of subsets of E. 

The rank of a subset of E is the maximum size of an independent set 
in it. The rank function ry is defined by r(X) = max{|Y|: Y c X, Ye}. 
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8.2.3. Example. Hereditary systems. Label each vertex a = (a1,...,da,) of the 
hypercube Q,, by the corresponding set X, = {i : a; = 1}. Draw Q, in the plane 
so that the vertical coordinates of vertices are in order by the size of the sets 
labeling them. 

The diagram below illustrates the relationships among the independent 
sets, bases, circuits, and dependent sets of a hereditary system. The bases are 
the maximal elements of the family I and the circuits are the minimal elements 
not in I. In every hereditary system, @ belongs to I. If every set is independent, 
then there is no circuit, but there is always at least one base. 

In the example on the right, the independent sets are the acyclic edge sets 
in a graph with three edges. The only dependent sets are {1, 2} and (1, 2, 3}, 
the only circuit is {1,2}, and the bases are {1, 3} and {2,3}. The rank of an 
independent set is its size. For the dependent sets, we have r({1, 2}) = 1 and 
r({1, 2, 3}) = 2. | 


4) 
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8.2.4. Remark. Aspects of hereditary systems. A hereditary system M is de- 
termined by any of Iy, By, Cy, rm, et¢., because each aspect specifies the 
others. We have expressed By, Cy, ry in terms of Iy. Conversely, if we know 
By, then Iy consists of the sets contained in members of By. If we know Cy, 
then Iy, consists of the sets containing no member of Cy. If we know ry, then 
Iy ={X C E: ry(X) = |X}}. a 


Hereditary systems are too general to behave nicely. We restrict our at- 
tention to hereditary systems having an additional property, and these we call 
matroids. We can translate any restriction on Iy into a corresponding re- 
striction on some other aspect of the hereditary system. Because hereditary 
systems can be specified in many ways, we have many equivalent definitions of 
matroids. Using various motivating examples, we state several of these prop- 
erties that characterize matroids. Later we prove that they are equivalent. We 
begin with the fundamental example from graphs. 


8.2.5. Definition. The cycle matroid M(G) of a graph G is the hereditary 
system on E(G) whose circuits are the cycles of G. A hereditary system 
that is M(G) for some graph G is a graphic matroid. 


Section 8.2: Matroids 351 


8.2.6. Example. Bases in cycle matroids. The bases of the cycle matroid M(G) 
are the edge sets of the maximal forests in G. Each maximal forest contains 
a spanning tree from each component, so they have the same size. Consider 
B,, Bz € Bwithe € B, — Bo. Deleting e from B, disconnects some component 
of B,; since Bz contains a tree spanning that component of G, some edge f € 
Bz — B, can be added to B, — e to reconnect it. 

For a hereditary system M, the base exchange property is: if B,, Bz € 
By, then for all e € B, — Bo there exists f € Bz — B, such that B} —e+ f € By. 
Matroids are the hereditary systems satisfying the base exchange property. @ 


8.2.7. Remark. In this subject, we often discuss inclusion and omission of sin- 
gle elements from sets. For symmetry and simplicity, we use the symbols + and 
— instead of U and — for this, and we drop the set brackets on 1-element sets. # 


8.2.8. Example. Rank function in cycle matroids. Let G be a graph with n 
vertices. For X C E(G), let Gy denote the spanning subgraph of G with edge 
set X. In M(G), an independent subset of X is the edge set of a forest in Gy. 
When Gy has k components, the maximum size of such a forest is n — k. Hence 
r(X) =n—k. Below we show such a forest Y (bold) within X (bold and solid). 

Ifr(X +e) = r(X) for some e € E —X, then the endpoints of e lie in a single 
component of Gy; adding e does not combine components. If we add two such 
edges, then again we do not combine components. Therefore, r(X) =r(X +e) = 
r(X + f) implies r(X) = r(X +e+ f). = 

For a hereditary system M on E, the (weak) absorption property is: if 
X C Eande, f € E, thenr(X) = r(X+e) = = r(X+/f) implies r(X+e+ f) =r(X). 
Matroids are the hereditary systems satisfying the absorption property (name 
suggested by A. Kézdy). | 


>/mat 
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Graphs may have loops and multiple edges. In cycle matroids, they lead to 
circuits of sizes 1 and 2. We use these terms for hereditary systems in general. 


8.2.9. Definition. In a hereditary system, a loop is an element forming a cir- 
cuit of size 1. Parallel elements are distinct non-loops forming a circuit of 
size 2. A hereditary system is simple if it has no loops or parallel elements. 


8.2.10. Definition. The vectorial matroid on a set FE cf vectors in a vector 
space is the hereditary system whose independent sets are the linearly 
independent subsets of vectors in E. A matroid expressible in this way is 
a linear matroid (or representable matroid). The column matroid 
M(A) of a matrix A is the vectorial matroid defined on its columns. 
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8.2.11. Example. Circuits in vectorial matroids. The set E may have repeated 
vectors; these would be parallel elements. The circuits are the minimal sets 
{x1,...,%%} C E such that >> c;x; = 0 using coefficients not all zero. Minimality 
forces all c; 4 0. 

Let C1, C2 be distinct circuits containing x. Using the equations of de- 
pendence for C; and C2, we can write x as a linear combination in terms of 
C, — x and in terms of C2 — x. Equating these expressions yields an equation 
of dependence for C; U Co — x; thus C 1 UC2g — x contains a circuit. 

For a hereditary system M on E, the (weak) elimination property is: 
whenever C1, Co are distinct circuits and x € C; M Co, another member of Cy 
is contained in C,; U C2 — x. Matroids are the hereditary systems satisfying the 
weak elimination property. 

The column matroid of the matrix below is also the cycle matroid M(K4—e). 


00011 
(0 1 1 0 1) a 
110 0 0 


8.2.12. Definition. The transversal matroid induced by sets A1,..., Am with 
union E is the hereditary system on E whose independent sets are the 
systems of distinct representatives-of subsets of {A1,..., Am}. Equivalently, 
letting G be the E, [m]-bigraph defined by e < i if and only if e € A;, the 
independent sets are the subsets of FE that are saturated by matchings in G. 


8.2.13. Example. Independent sets in‘transversal matroids. When M, M’ are 
matchings in G and |M ‘| > |M|, the symmetric difference MaM’ contains an 
M-augmenting path P (Theorem 3.1.10). Replacing MN P with M’N P yields a 
matching of size |M|+1 that saturates all vertices of M plus the endpoints of P. 

Consider independent sets /;, J; in the transversal matroid generated by 
Ai,..., Am. In the associated bipartite graph, let M,, Mz be matchings satu- 
rating /,, J2, respectively (on the left below, M, is solid and M2 is dashed). If 
[J2| > |;|, then the matching obtained from M, by using an M;-augmenting 
path in M2aM, saturates /; plus an element e € Jz — 1;; this “augments” Jo. 

For a hereditary system on £, the augmentation property is: for distinct 
11, Ig € I with |Jg| > |J4|, there exists e € J, — 1, such that J, U {e} e I. Matroids 
are the hereditary systems satisfying the augmentation property. 

The transversal matroid of the family A = {{1, 2}, {2, 3, 4}, (4, 5}}, illus- 


trated by the bipartite graph on the right, is again M(K4 — e). a 
Nh hhh se 12 3 4 65 
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The name “transversal matroid” arises from the use of “transversal” in 
systems of distinct representatives. An SDR for a subset of {A1,...,A,,} is a 
partial transversal for the full system. The independent sets of the transver- 
sal matroid on |_) A; are the partial transversals of {A1,..., A,,}. That these are 
matroids was discovered by Edmonds and Fulkerson [1965] and independently 
by Mirsky and Perfect [1967], who extended the result to infinite sets. 

Every matroid must satisfy all properties of matroids. Once we show that 
the properties defined above are equivalent for hereditary systems, we need 
only verify one to use all. First we check that they all hold for cycle matroids. 


8.2.14. Example. Augmentation in cycle matroids. Consider /;, Jz € Inc). As 
in Example 8.2.8, the spanning subgraph G,, has k = n — |/,| components, and 
its largest forest has n — k = |J,| edges. Therefore, the forest J. has some edge 
with endpoints in two components of G;,. This edge can be added to /; to obtain 
a larger independent set. Hence the augmentation property holds. a 


8.2.15. Example. Weak elimination in cycle matroids. The circuits of M(G) are 
the edge sets of cycles of G. Cycles have even degree at each vertex. If C1, C2 € 
C, then the symmetric difference C;aC2 also has even degree at each vertex. If 
C; # C2, this implies that C,;aC2 contains a cycle (see Proposition 1.2.27). This 
is stronger than the weak elimination property, since CyaCg C Cy UC2g — x. In 
the figure below, C; and C2 are face boundaries of length 9 sharing the dashed 
edges, and C,Cz is the union of two disjoint cycles. | 


For transversal matroids, the base exchange property is similar to the aug- 
mentation property; Exercise 9 considers the weak elimination property. For 
linear matroids, directly verifying the augmentation or base exchange property 
requires the algebraic result that k linearly independent vectors cannot all be 
expressed as linear combinations of a smaller set. Instead, we can use Theo- 
rem 8.2.20. Since the weak elimination property holds for independent sets of 
vectors, many theorems of linear algebra follow from Theorem 8.2.20! 


8.2.16. Remark. Notational conventions: Boldface I, B, C for families of sub- 
sets of E allows J <I, B € B, C € C to denote members of the families. Roman 
letters I, B, C, R denote properties that yield matroids. We use e, f,x, y as 
elements of E, and we use X, Y, F as subsets of E. | 


Every hereditary family is the collection of independent sets of a hereditary 
system. A collection B is realizable as the set of bases of a hereditary system if 
and only if B is nonempty and no element of B contains another. A collection 
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C is realizable as the set of circuits of a hereditary system if and only if the 
elements of C are nonempty and no element of C contains another. 

The characterization of rank functions is more subtle. It includes two prop- 
erties (r1, r2 below) that we will need, plus an additional technical condition 
that forces r to be the rank function of the hereditary system M defined by 
Iy ={X © Ev r(X) = |X]}. 


8.2.17. Lemma. For the rank function r of a hereditary system on E, 
(r1) r(@) = O. 
(r2) r(X) < r(X +e) <r(X)+1 whenever X ¢ Eandee E. 


Proof: From the definition r(X) = max{|Y|: Y C X.Y € Ij, we have r(@) = 0. 
Because X + e contains every independent subset of X, also r(X + e) > r(X). 
Because the independent subsets of X + e not contained in X consist of e plus 
an independent subset of X, we have r(X +e) <r(X)+1. | 


PROPERTIES OF MATROIDS 


We have remarked that many equivalent conditions on hereditary systems 
yield matroids. We can show that a hereditary system is a matroid by verifying 
any of them, after which we can employ them all without additional proof. We 
obtained the same benefit from equivalent characterizations of trees. 

Adding an edge to a forest creates at most one cycle. More generally, adding 
one element to an independent set in a matroid creates at most one circuit. Our 
proof of the greedy algorithm for spanning trees (Theorem 2.3.3) used only this 
property of graphs. This “induced circuit” property is one of the conditions that 
characterize matroids, as is the effectiveness of the greedy algorithm itself! 
Both properties appear in our list. 

Given weights on the elements of a matroid, the greedy algorithm is the 
process of iteratively including an element of largest nonnegative weight whose 
addition to the independent set already selected yields a larger independent 
set. Rado [1957] proved that matroids are precisely the hereditary systems 
for which the greedy algorithm selects a maximum-weighted independent set 
regardless of the choice of weights. 


8.2.18. Definition. A hereditary system M on E is a matroid if it satisfies 
any of the following additional properties, where I, B,C, and r are the 
independent sets, bases, circuits, and rank function of M. 

I: augmentation—if J), Jz € Iwith|J2| > ||, then /,;+e € Iforsomee € ,p—-]h. 

U: uniformity—for every X C E, the maximal subsets of X belonging to I have 
the same size. 

B: base exchange—if B,, B) € B, then for all e € B, — Bz there exists f € 
By — By such that B, —e+f eB. 

R: submodularity—r(X NY) +r(X UY) <r(X)+r(Y) whenever X.Y C E. 

A: weak absorption—r(X) = r(X +e) =r(X + f) implies r(X¥ + e+ f) =r(X) 
whenever X C E ande, f € E, 
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A’: strong absorption—if X,Y C E, and r(X + e) = r(X) for all e € Y, then 
r(X UY) =r(X). 

C: weak elimination—for distinct circuits C;, Co € C and x € C,; N Co, there 
is another meniber of C contained in (C, U Co) — x. 

J: induced circuits—if / <€ I, then / + e contains at most one circuit. 

G: greedy algorithm—for each nonnegative weight function on E, the greedy 
algorithm selects an independent set of maximum total weight. 


The base exchange property implies that all bases have the same size: if 
|B;| < |Bo| for some B,, Bo € B, then we can iteratively replace elements of 
B, — Bz by elements of Bz — B; to obtain a base of size |B1| contained in Bo, but 
no base is contained in another. 


8.2.19.* Remark. The rank of a set X C E in a vectorial matroid is the dimen- 
sion of the space spanned by X. Hence for vectorial matroids the submodularity 
inequality says that dim UNV+dimU@V < dimU+dim V, where U, V, U@V 
are the spaces spanned by subsets X, Y, X UY of E, respectively. The usual proof 
of this is the vector space statement of our proof of U=>R below. Exercise 10 
obtains submodularity directly for cycle matroids. 

Various of these properties (together with requirements for a hereditary 
system) have been used as the defining condition for matroids. Examples in- 
clude I (Welsh [1976], Schrijver [to appear]), U (Edmonds [1965b,c], Bixby 
[1981], Nemhauser—Wolsey [1988]), A (Whitney [1935]), C (Tutte [1970]), G 
(Papadimitriou-Steiglitz [1982]), and others (van der Waerden [1937], Rota 
[1964], Crapo—Rota [1970], Aigner [1979]). | 


Many authors include basic properties of hereditary systems in the set of 
axioms characterizing some aspect of a matroid. This can distract from the 
special additional properties of matroids and lead to extra work. Starting with 
hereditary systems yields more concise proofs. All properties of hereditary 
systems are always available. 


8.2.20. Theorem. For a hereditary system M, the conditions defining matroids 
in Definition 8.2.18 are equivalent. 


Proof: U = B. By uniformity for X = E, all bases have the same size. We then 
apply uniformity to the set (B; — e) U Bz. This yields an augmentation of the 
independent set B; — e from Bg to reach size | Ba]. 

B= I. Given independent sets /;, Jz € I with |J2| > |Ji}, choose B,, Bz € B 
such that [; C By, Jz C Bg. We use base exchange to replace elements of B; — 1; 
outside Bz with elements of By. Hence we may assume that B; — J; C Bo. If 
By, — lh C Be — Ie, then |B,| < |Bg|, which is forbidden by the base exchange 
property as remarked above. Hence /, has an element in B, — J,, and we use 
such an element to augment /. 

I> A. Suppose that r(X) =r(X+e) =r(X+f). Ifr(X+e+f) > r(X), then 
let /;, Jz be maximum independent subsets of X and of X +e+ f. Now |Jg| > |/1I, 
and we can augment /, from J2. Since J; is a maximum independent subset of 
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X, the augmentation can only add e or f, which contradicts the hypothesis that 
r(X)=r(X t+e)=r(X4 f). 

A => A’. We use induction on |Y — X|._ The statement is trivial when 
|Y — X| = 1. When |Y — X| > 1, choosee, f ¢ Y-—X, andlet Y’'= Y —e—‘f. Ap- 
plying the induction hypothesis to proper subsets of Y yields r(X) = r(X UY’) = 
r(X UY'+e)=r(X UY’ + f). Now weak absorption yields r(X) = r(X UY). 

A’ > U. If Y is a maximal independent subset of X, then r(Y +e) = r(Y) for 
all e ¢ X — Y. By strong absorption, r(X) = r(Y) = |Y|. Hence all such Y have 
the same size. 

U=R. Given X, Y C E, choose a maximum independent set /; from X NY. 
By uniformity, /; can be enlarged to a maximum independent subset of X U Y; 
call this J. Consider Jz X and I. NY, these are independent subsets of X and 
Y, and each includes /;. Hence 


rXNY)+r(X UY) = |hl+ lel = len Xl) + |b NY| sr(X)+r(Y¥). 


J=>G 


R= C. Consider distinct circuits C}, C2 « C with x € C1 C2. We have 
r(Cy) = |Ci| — 1 and r(Cg) = |C9| — 1. Also r(Cy N Ce) = |C1 N Cel, since every 
proper subset of a circuit is independent. If (C; U C2) — x does not contain a 
circuit, then r((CyUC2) —x) = |Cy U Ce|+1, and hence r(C,UC2) >.|Cy U C2|—1. 
Applying submodularity to C; and C2 yields the contradiction 


ICy AN Cel + 1Cy U Ce] — 1 < |Ci| + [Cel — 2. 


C= J. If ] + e contains C;, C2 € C for some / € I, then Cy, C2 both contain 
e. Now weak elimination guarantees a circuit in (C; U C2) — e. On the other 
hand, (C; U C2) — e is independent, being contained in J. 

J => G. For weight function w, let / be the output of the greedy algorithm. 
Among the maximum-weight independent sets, let /* be one having largest 
intersection with /. The algorithm cannot end with / c /*. If J + 7”, then let 
e be the first element of / — /* chosen by the algorithm. By the choice of /*, 
I* +e is dependent; hence it has a unique circuit C. Since C ¢ /, we may. choose 
f ¢€ C—I. Since /* + e has no other circuit, /* +e — f € I. The optimality 
of [* yields w(f) > w(e). Since f and the elements of J chosen earlier than 
e all lie in /*, f does not complete a circuit with them. Thus f was available 
when the algorithm selected e, which yields w(f) < w(e). Now w(f) = w(e) 
and w(l* +e— f) = w(/*). With |J* +e — fNI| > |J* N1|, this contradicts the 
choice of /*. Thus /* = J. 

G= I. Given J;, lz € I with k = || < |/o|, we design a weight function 
for which the success of the greedy algorithm yields the desired augmentation. 
Let w(e) =k +2 fore € 1, and let w(e) =k +1 fore e h-hh. Let we) = 
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0 fore ¢ } Ulh. Now w(lg) > (k +1)? > k(kK +2) = wh), so ; is not a 
maximum-weighted independent set. However, the greedy algorithm chooses 
every element of J; before any element of Jz — /;. Because it finds a maximum- 
weighted independent set, it continues after absorbing /, and adds an element 
e€ Ig — 1, suchthat J, +e eI. | 


The property most often used to show that a hereditary system is a matroid 
is the augmentation property. 


8.2.21. Example. The uniform matroid of rank k, denoted U;.,, when |E| =n, 
is defined by I = {X C E: |X| < k}. This immediately satisfies the base ex- 
change and augmentation properties. The free matroid is the uniform ma- 
troid of rank |E|. Uniform matroids are used in building more interesting 
matroids and in characterizing classes of matroids. Few uniform matroids are 
graphic, and few graphic matroids are uniform (Exercise 6). Neither M(K4 —e) 
nor M(K,z) is a uniform matroid. 

A linear matroid representable over the field Zz or Z3 is binary or ternary, 
respectively. Every graphic matroid is binary (Exercise 43); Us4 is ternary 
(Exercise 44) but not binary (and hence not graphic). a 


8.2.22. Example. The partition matroid on E induced by a partition of E into 
blocks E;,..., E, is defined by I= {X € E: |X NE;,| < 1 for all i}. Since @ € I, 
and since X € I whenits elements lie in distinct blocks, Tis a hereditary family. 
Given J}, J, € I with |J2| > |J,|, the set J, must intersect more blocks than /;; 
an element of J, in a block that J; misses yields the desired augmentation of /;. 
Alternatively, r(X) is the number of blocks having elements in X; this satisfies 
the absorption property. (Note: M(K,4 — e) is not a partition matroid.) 

Given a U, V-bigraph G, the incidences with U = u,..., u, define a parti- 
tion matroid on E(G) (this differs from the transversal matroid on U induced 
by G). The blocks are the sets E; = {e € E(G): u; € e}. Aset X C E(G) isa 
matching in G if and only if X is independent in the partition matroid induced 
by U and in the partition matroid induced by V. This is the motivation for our 
later discussion of matroid intersection. 


Ey E2 Eg. E, Es 


Fy Fo F3 Fy Fs 


When G has an odd cycle, G has no set of vertices whose incident sets 
partition E(G). In a digraph, however, each edge has a head and a tail, and we 
can define the head partition matroid and thé tail partition matroid using 
the edge partitions induced by incidences with heads and by incidences with 
tails. (Example: The matroid of Example 8.2.3 arises as the partition matroid 
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on E induced by U in the bipartite graph below, as the head partition matroid 
in the first digraph, and as the tail partition matroid in the second digraph.) & 


THE SPAN FUNCTION 


We next introduce several additional aspects of hereditary systems and 
matroid properties involving them. We use these aspects to illuminate matroid 
duality, which will lead to a characterization of planar graphs using matroids. 

The algebraic concept of the space “spanned” by a set of vectors extends to 
hereditary systems. The definition is suggested by cycle matroids; a set spans 
itself and the elements that complete circuits with its subsets. 


8.2.23. Definition. The span function of a hereditary system M is the func- 
tion oy on the subsets of E defined by oy (X) = X U{e € E: Y +e € Cy for 
some Y C X}. Ife € o(X), then X spans e. 


In a hereditary system, X is a dependent set if and only if it contains a 
circuit, which by Definition 8.2.23 holds if and only if e € o(X — e) for some 
e € X. We can therefore find the independent sets from the span function via 
I= {X C E:(e € X) => (e ¢ o(X — e))}. The properties of span functions 
that we use in studying matroids are (s1, s2, s3) below (an additional technical 
condition is needed to characterize the span functions of hereditary systems). 
- First we illustrate property (s3) using graphs. 


8.2.24. Example. In the cycle matroid M(G), the meaning of e ¢ o(X) is that 
X has no path between the endpoints of e. If also e € o(X + f), then adding 
f completes such a path. The path completes a cycle with e, and hence also 
f €o0(X +e). In the figure below, X consists of the four bold edges. w 


8.2.25. Proposition. Ifo is the span function of a hereditary system on E, and 
X,Y C E, then the following properties hold. 
sl) X Co(X) (co is expansive). 
s2) Y C X implies o(Y) C o(X) (o is order-preserving). 
s3)e ¢o(X) ande €0(X + f) imply f € o(X +e) (Steinitz exchange). 


Section 8.2: Matroids 359 


Proof: Definition 8.2.23 implies immediately that o is expansive and order- 
preserving. Ife € o(X + f), then e belongs to a circuit C in X + f +e. Ifalso 
e ¢ o(X), then f € C. This circuit yields f € o(X +e), and hence o satisfies 
the Steinitz exchange property. | 


Properties of the span function lead to a short proof of a stronger form of 
the elimination property. The weak elimination property states that when e € 
- CyNCo, there is a circuit in (Cj; UC2)—e. Cycle matroids have the much stronger 
property that C,aC2 is an edge-disjoint union of cycles, since every vertex in 
C ,4C2 has even degree. General matroids have the intermediate property that 
all elements of the symmetric difference belong to cycles in (C, U C2) — e when 
e € Cy M C2 (Property C’ below). 

We need a property relating rank and span in hereditary systems. The 
truth of the converse is our next characterization of matroids. 


8.2.26.* Lemma. In a hereditary system, [r(X + e) = r(X)] > e € a (X). 


Proof: Let Y be a maximum independent subset of X. Since |Y| = r(X) = 
r(X +e), also Y is a maximum independent subset of X + e. Hence e completes 
a circuit with some subset of X contained in Y, and e € o(X). | 


8.2.27.* Theorem. If M is a hereditary system, then each condition below is 
necessary and sufficient for M to be a matroid. 

P: incorporation—r (o(X )) = r(X) for all XCE. 

S: idempotence—o*(X) = 0 (X) for all XCE. 

T: transitivity of dependence—if e E o(X) and X Ca(Y), thene €o(Y). 

C’: strong elimination—whenever Ci, Co € C,e € Cy NCo, and f € CyaCo, 
there exists C € C such that fe CC Cy UC2 -e. 


Proof: U > P. Every element in o(X) — X completes a circuit with a subset of 
of X and thus lies in the span of every set between X and o(X). Thus it suffices 
to prove that r(Y +e) =r(Y) whene € o(Y). Let Z be a subset of Y such that 
Z+eeC,. Augment Z to a maximal independent subset / of Y +e, By the 
uniformity property, |/| = r(Y +e). Since Z+e € C, wehavee ¢ /. Thus/ CY, 
and we have r(Y) > |J| = r(Y +e). (Absorption can be used instead.) 

P= S. Since o is expansive, o7(X) > o(X), and we need only show that-e € 
o7(X) implies e € o(X). By the incorporation property, r(o(X) + e) = r(o(X)) 
and r(a(X)) = r(X). Since X C'o(X), monotonicity ofr yields r(X) < r(X +e) < 
r(o(X) +e) = r(X). Since equality holds throughout, Lemma 8.2.26 yields 
eca(X). 

S=> T. If X C o(Y), then the order-preserving and idempotence properties 
of o imply 0 (X) C o2(Y) =a(Y). 

T= C’. Given distinct C,, Co €e Cwithe € CyNC2 and f € C:—Co2, we want 
f €o0(Y), where Y =)C; UC2) —e — f. We have f € o(X), where X = C; — f. 
By T, it suffices to show X Co (Y). Since X — e C Y Ca (Y), we need only show 
e€o(Y). Since o is order-preserving, we have e € 0(C2 —e) Ca(Y). 

C’ >C. C is a less restrictive statement than C’. a 
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Like uniqueness of induced circuits (J), the incorporation property (P) re- 
lates two aspects of hereditary systems. These are well-known properties of 
matroids, and in the approach via hereditary systems they become characteri- 
zations. The equivalence of C and C’ was first proved by Lehman [1964]. 

Idempotence occurs naturally for graphic and linear matroids. The span 
of a set of vectors contains nothing additional in its span; similarly, every edge 
that can be added to the span of a set of edges joins two components. This 
suggests related aspects of hereditary systems. 


8.2.28. Definition. The spanning sets of a hereditary system on E are the 
sets X C E such that o(X) = E. The closed sets are the sets X C E such 
that o(X) = X (also called flats or subspaces). The hyperplanes are the 
maximal proper closed subsets of E. 


8.2.29.* Remark. The span function of a matroid is also called its closure 
function. A closure operator is an expansive, order-preserving, idempotent 
function from the family of subsets of a set to itself. A closure operator is the 
span function of a matroid if and only if it has the Steinitz exchange property. 
In every hereditary system, the span function satisfies Steinitz exchange. 
Thus treating matroids as hereditary systems with additional properties is not 
well suited for studying closure operators. The span function of a hereditary 
system M is a closure operator if and only if M is a matroid. Matroids are de- 
veloped from lattice theory in MacLane [1936], Rota [1964], and Aigner [1979]. 
We have not considered all relationships among aspects of matroids. Bry- 
lawski [1986] presents a matrix describing the transformations among about a 
dozen aspects of matroids, calling these maps eryptomorphisms. a 


THE DUAL OF A MATROID 


Duality in matroids generalizes the notion of duality for planar graphs. 
Every connected plane graph G has a natural dual graph G* such that (G*)* = 
G. The dual is formed by associating a vertex of G* with each face of G and 
including a dual edge e* in G* for each edge of G, such that the endpoints of the 
edge e* are the vertices for the faces on the two sides of e. 

A set of edges in a plane graph G forms a spanning tree in G if and only if 
the duals to the remaining edges form a spanning tree in G* (Exercise 6.1.21). 
Hence the bases in the cycle matroid M(G*) are the complements of the bases 
in M(G). We define duality for matroids and hereditary systems so that the 
properties of duality in planar graphs generalize. 


8.2.30. Definition. The dual of a hereditary system M on E is the hereditary 
system M* whose bases are the complements of the bases of M. The aspects 
B*(By:), C*, I*, r*, o*, of M* are the cobases, cocircuits, etc., of M. 

The supbases S of M are the sets containing a base. The hypobases 
H are the maximal subsets containing no base. We write X for E — X. 
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8.2.31. Lemma. If M is a hereditary system, then 

a) B* = {B: B € B} and (M*)* = M. 

b) It = (S: S € S} and St = (7: Je I}. 

c) C* = {H: H € H} and H* = {C: C € C}. 
Proof: The statement about B* is the definition of M*. It immediately yields 
(M*)* = M and both parts of (b). Also, X is a maximal (proper) subset of E 
containing no base (a hypobase of M) if and only if X is a minimal nonempty 
set contained in no cobase, which is a circuit of M*. Similarly, the hypobases of 
M* are the complements of the circuits of M. | 


We have chosen “supbase” and “hypobase” to share initials with “spanning” 
and “hyperplane”, because for matroids the spanning sets and supbases are the 
same, and the hyperplanes and hypobases are the same. 


8.2.32. Lemma. If M is a matroid, then the supbases are the spanning sets, 
and the hypobases are the hyperplanes. 


Proof: A set X is spanning if and only if o(X) = E. By the incorporation 
property, this is equivalent to r(X) = r(E). By the uniformity property, this is 
equivalent to X containing a base. For hyperplanes, see Exercise 32.) | 


books 


Consider B,, B, © E. If neither of B,, Bz contains the other, then also 
neither of B;, Bz contains the other. Therefore, the dual of a hereditary system 
is a hereditary system. The notion of duality becomes useful when we prove 
that the dual of a matroid is a matroid. (This follows easily from a dual version 
of the base exchange property. 


telegr 


8.2.33. Lemma. If M is.a matroid and B;, Bz € B, then for each e € B, — Bo 
there exists f € Bz such that By + e — f is a base. 


Proof: Since Bz is a base, Bz + e contains exactly one circuit C. Since B, is 
independent, C also contains an element f € Bz — B). Now B. +e — f contains 
no circuit and has size r(E). a 


8.2.34. Theorem. (Whitney [1935]) The dual of a matroid M on E is a matroid 
with rank function r*(X) = |X| — (r(E) — r(X)). 
Proof: We have observed that M* is a hereditary system; now we prove the base 
exchange property for M*. If Bi, By € B* ande € B, — Bo, then By, By € B, with 
e € By— By. By Lemma 8.2.33, there exists f € B; — By such that B}+e-—f € B. 
Now B; —e + f € B* is the desired exchange. 
To compute r*(X), let Y be a maximal coindependent subset of X, so r*(X) = 
r*(Y) = |Y|. By Lemma 8.2.31, Y is aminimal superset of X that contains a base 
of M. Since Y arises from X by augmenting a maximal independent subset of 
X to become a base, we have |Y| — |X| = r(Z) —r(X). With |Y| — |X| = |X|-IYI, 
this yields the desired formula 


r*(X) =|¥|] =|X| —({¥| - |X|) = 1X - (r(E) —r(X)). a 
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We can restate any matroid property using dual aspects. Exercises 33-34 
request characterizations of hyperplanes and closed sets by this method. More 
subtle results involve relationships between a matroid and its dual. 


8.2.35. Proposition. (Dual augmentation property) Let M be a matroid. If 
X €Tand X’ € I* are disjoint, then there are disjoint B € B and B’ € B* 
such that X C B and X’ C B’. 


Proof: Since X’ is coindependent in M, X’ is spanning in M . Hence every 
maximal independent subset of xX! is a base; we augment X C X’ to a base B 
contained in X’. The cobase B’ = B contains X’. a 


We will use cycle matroids to characterize planar graphs. The next result 
enables us to describe the cocircuits of a cycle matroid. 


8.2.36. Proposition. Cocircuits of a matroid are the minimal sets intersecting 
every base. Bases are the minimal sets intersecting every cocircuit. 


Proof: The cocircuits are the minimal sets contained in no cobase. Because 
the cobases are the complements »f the bases, a set is contained in no cobase if 
and only if it intersects every base. Similarly, the cobases are the maximal sets 
containing no cocircuit, so the complements of the cobases are the minimal sets 
intersecting every cocircuit. a 
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8.2.37. Corollary. The cocircuits of the cycle matroid M(G) are the bonds of G. 


Proof: By Proposition 8.2.36, the cocircuits are the minimal sets intersect- 
ing every maximal forest. Hence they are the minimal sets whose deletion 
increases the number of components; these are the bonds. | 


8.2.38. Definition. The bond matroid or cocycle matro?‘1 of a graph G is 
the hereditary system whose circuits are the bonds of G. 


By Corollary 8.2.37, the bond matroid of G is the the dual of the cycle 
matroid M(G). Weak elimination now applies to bonds. Since a cycle must 
return to its starting point, it cannot intersect a bond in exactly one edge. This 
generalizes to matroids as another characterization of cocircuits. 


8.2.39. Theorem. The cocircuits of a matroid M on £ are the minimal 
nonempty sets C* C E such that |C*N C| 4 1 for every C € C. 


Proof: To show that every cocircuit has this property, suppose that C ¢€ C, 
C* € C*, C*NC =e. Then C —e € Land C*—e € I’, and the dual augmentation 
property yields B € B and B € B* such that C —e C Band C* —e CB. Since e 
must eppear in B or B*, we obtain C ce lorC* el. 

For the converse, we show that every nonempty set in Ix meets some C € C 
in one element; since cocircuits do not, every minimal set that does not is a 
cocircuit. Choose X* € I*. Let B* be a cobase coniaining x, and let B = B*. 
For each e € X*, B + e contains a circuit C, and X* 7: C = {e}. a 


Section 8.2: Matroids 363 
MATROID MINORS AND PLANAR DUALS 


From a graph G we can obtain smaller graphs by repeatedly deleting and/or 
contracting edges. The resulting graphs are the minors of G. Wagner [1937] 
proved that G is planar if and only if it does not have Ks or K3.3 as a minor (Ex- 
ercise 6.2.12). Hadwiger [1943] conjectured that G is k-colorable if G has no 
minor isomorphic to K,,;. A simple graph is a forest if and only if it does not 
have C3 as a minor. 

To generalize these operations to matroids, we need to know how deletion 
and contraction affect cycle matroids. The acyclic subsets of E(G — e) are pre- 
cisely the acyclic subsets of E(G) that omit e. The acylic subsets of E(G - e) are 
the subsets of E(G) — e whose union with e is acyclic in G. A dual description of 
contraction is more convenient: X contains a spanning tree of each component 
of G - e if and only if X + e contains a spanning tree of each component of G. 

We also want the notation to extend in a natural way. This causes diffi- 
culty, because discussion of graph minors often emphasizes the edges removed, 
while discussion of matroid minors emphasizes the elements that remain. We 
compromise by using matroid notation for the matroid on the set that remains 
while extending graph notation to describe matroids obtained by deleting or 
contracting one element. 


ks 
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8.2.40. Definition. For a-hereditary system M on E, the restriction of M 
to F < E, denoted M|F and obtained by deleting F, is the hereditary 
system defined by Iyjrp = {X C F: X € Iy}. The contraction of M to 
F C E, denoted M.F and obtained by contracting F, is the hereditary 
system defined by Sy. = (X C F: X UF € Sy}. When F = E —e, we 
write M—e = M|F and M-e = M.F. The minors of M are the hereditary 
systems arising from M using deletions and contractions. 


The definitions imply that M|F and M.F are hereditary systems. The op- 
erations of restriction and contraction commute (Exercise 41). The definition 
of contraction via supbases yields a natural duality between these operations. 


8.2.41. Proposition. For hereditary systems, restriction and contraction are 
dual operations: (M.F)* = (M*|F) and (M|F)* = (M*.F). 
Proof: (yr = {X C F: F — X € Sy.r} ={X C F: (F — X)UF € Sy} 
={xX ¢ F: X €Suy={XC F: X €Iy-} = Iyer. 
For the second statement, apply the first to M* and take duals. | 


The duality between deletion and contraction is most intuitive for plane 
graphs. Deleting an edge e in a plane graph G contracts the corresponding dual 
edge in G*; contracting e deletes the edge in the dual. 


‘\ 7 
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8.2.42. Corollary. Under deletion or contraction of an edge e in a graph G, the 
cycle matroid and bond matroid behave as listed below. 


M(G—e)=M(G)-—e M*(G—e)=M*(G)-e 
M(G-e)= M(G)-e M*(G -e) = M*(G)-e 


Proof: Matroid deletion and contraction are defined so that the statements 
in the first column describe the behavior of cycle matroids. Using these and 
Proposition 8.2.41, we compute 


M*(G —e) =|M(G— e)]* = [M(G) —e]* = M*(G)-e, and 
M*(G-e) = [M(G.- e}}* = [M(G) - e]* = M*(G) —e. a 


As desired, restrictions and contractions of matroids are matroids. 


8.2.43. Theorem. Given F C E and a thatroid M on E, both M|F and M.F are 
matroids on F. In terms of ry, their rank functions are ryj-(X) = ru(X) 
and ru.F(X) = ryu(x U F) — ru(F). 


Proof: The augmentation property from M applies to any pair of sets in Iy,F; 
thus M|F satisfies the augmentation property and is a matroid. Using duality, 
M.F = (M*|F)* is also a matroid. The-rank function for M|F follows from the 
definition of Iy)-. This and repeated application of Theorem 8.2.34 to (M*|F)* 
yields the rank function for M.F (Exercise 42). a 

The formula for ry - yields a description of the independent sets: X € Iyr 
if and only if adding X to F increases the rank by |X|. 

A set of edges in a plane graph G forms a cycle if and only if the corre- 
sponding dual edges form a bond in G* (Theorem 6.1.14). Using the natural 
bijection between edges and dual edges, this tells us that the cycle matroid of a 
plane graph G is (isomorphic to) the bond matroid of G*. By Corollary 8.2.37, 
the bond matroid of a graph H is [M(H)]*. Applying this to G and to G* tells us 
that the bond matroid of G is (isomorphic to) the cycle matroid of G*. Thus the 
bond matroid of a planar graph G is graphic. Using Kuratowski’s Theorem, we 
will prove that this condition characterizes planarity. 

Whitney [1933a] approached this by defining a non-geometric nection of 
dual. Changing his definition slightly, we say that H is an abstract dual of 
G if there is a bijection ¢: E(G) — E(H) such that X C E(G) is a bond in G 
if and only if @(X) is the edge set of a cycle in H. With this definition, saying 
that G has an abstract dual is the same as saying that the bond matroid of G is 
graphic; the bijection ¢ establishes an isomorphism between M*(G) and M(H). 


n.n 


8.2.44. Theorem. (Whitney [1933a]) A graph G is planar if and only if its bond 
matroid M*(G) is graphic. 
Proof: We first prove that existence of an abstract dual is preserved under 


deletion and contraction of edges. Suppose that G has an abstract dual H, so 
that M(H) = M*(G). Let e’ be the edge of H corresponding to e under the 
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bijection. To prove that H - e’ is an abstract dual of G — e and that H — e’ is an 
abstract dual of G - e, we use Corollary 8.2.42 to compute 


M*(G —e) = M*(G)-e = M(H)-e'= M(H -e’), and 
M*(G-e) = M*(G) ~e= M(H) —e' = M(H —e’). 


We have demonstrated that planar graphs have abstract duals. By Kura- 
towski’s Theorem, a nonplanar graph contains a subdivision Ks or K3.3. Hence 
Ks or K3.3 is a minor of it. Since existence of abstract duals is preserved un- 
der deletion and contraction, showing that Ks and K33 have no abstract dual 
implies that every nonplanar graph has no abstract dual. 

If H is an abstract dual of G, then also G is an abstract dual of H, since 
M*(G) = M(A) if and only if M(G) = M*(#A). If G has girth g, then bonds of 
H have size at least g, so 6(H) > g. Also e(H) = e(G), and the degree-sum 
formula yields n(H) < |2e(H)/5(H)| < |2e(G)/g|. 

Let H be an abstract dual of K;. Since Ks has girth 3, n(H) < |20/3| = 6. 
Since all bonds of Ks have four or six edges, all cycles of H have four or six 
edges, and thus H is a simple bipartite graph. However, no simple bipartite 
graph with at most six vertices has ten edges. 

Let H be an abstract dual of K3.3. Since K3.3 has girth 4,n(H) < [18/4] =4 
Since all bonds of K3.3 have at least three edges, all cycles of H have at least 
three edges, and thus H is a simple graph. However, no simple graph with at 
most four vertices has nine edges. | 
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The argument that bond matroident plane graphs are graphic shows that 
every “geometric” dual of a planar graph i is an abstract dual. We have seen 
that the geometric dual need not be unique. Nevertheless, the cycle matroid of 
every graph dual to G must be M*(G); hence all geometric duals of G have the 
same cycle matroid. Whitney [1933b] determined when graphs have the same 
cycle matroid (see Exercise 45, also Kelmans [1980, 1987, 1988]). 

Minors have many applications. They will soon help us prove the Matroid 
Intersection Theorem. They are used in characterizing classes of matroids by 
forbidden substructures; for example, a matroid is binary if and only if it does 
not have U24 as a minor. Minors also are used to produce a winning strategy 
for a matroid generalization of Bridg-it (Theorem 2.1.17). 


8.2.45.* Definition. Givene ¢€ E and amatroid M on E, the Shannon Switch- 
ing Game (M, ec) is played by the Spanner and the Cutter. The Cutter 
deletes elements of E — e and the Spanner seizes them, one per move. The 
Spanner aims to seize a set that spans e, and the Cutter aims to prevent 
this. The Cutter moves first. 


Having the Spanner move first can be simulated by adding an element e’ 
such that {e, e’} is a circuit; the Cutter must begin by deleting e’ to avoid losing 
immediately. Bridg-it occurs by letting M be the cycle matroid of the graph 
in Theorem 2.1.17 with e the “auxiliary edge” and e’ an extra auxiliary edge. 
The spanning tree strategy for the Spanner results from the following sufficient 
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condition for a winning strategy. The condition is also necessary, but proving 
that requires the Matroid Union Theorem (Theorem 8.2.55). 


§.2.46.* Theorem. (Lehman [1964]) In the Shannon Switching Game (M, e), 
the Spanner has a winning strategy if there are disjoint subsets X1, X2 of 
E — e such that e € 0(X1) =a (Xo). 


Proof: We use X1, Xo to produce a winning strategy. Let X = o(X1) = 0(X2). 
Since the Spanner can ignore deletions outside X and play in M|(X +e), we may 
assume that X,, X2 are disjoint bases. If the Cutter plays g and the Spanner 
plays f, then g is no longer available and f cannot be deleted; the effect is 
deletion and contraction. Letting M’ = (M — g)- f, we have e € oy (X) if and 
only if g ¢ X ande € oy(X + f). The Spanner wins if e is a loop in M’, which is 
equivalent to e € oy(F), where F is the set seized by the Spanner. 

If |E| = 1, then e is a loop and the Spanner wins; we proceed by induction 
on |E|. It suffices to provide an immediate answer f to g so that M’ = (M—8)-f 
has two disjoint bases. If the Cutter deletes g not in X; or X2, then the Spanner 
seizes an arbitrary f, and the two sets X; — g — f and Xo — g— f are disjoint 
and spanning in M’. Hence we may assume that g € X,. The base exchange 
property yields f € X_ such that X’ = X;-—g+/f € B. Now X’ — f and X_- f 
are disjoint bases avoiding e in the game (M’, e). | 


MATROID INTERSECTION 


im.me/math_b 


Matroid theory took a great leap forward with the proof of the Matroid 
Intersection and Union Theorems by Edmonds. This provided a unified context 
for many well-known min-max relations, which became corollaries. We have 
proved some of these in earlier chapters. Yielding a simple unified proof for 
many important theorems, the Matroid Intersection Theorem can be considered 
among the most beautiful theorems of combinatorics. 

The Matroid Intersection Theorem is-a min-max relation for common in- 
dependent sets in two matroids on the same ground set. We can view the 
intersection of two matroids as a hereditary system, but not as a matroid. For 
multiple matroids on a set E, we typically use subscripts to distinguish corre- 
sponding aspects, as in B; for the bases of M;, etc. We still use X to denote the 
complement of X within the ground set E. 


8.2.47. Definition. Given hereditary systems M;, Mz on E, the intersec- 
tion of M, and Mz is the hereditary system whose independent sets are 
{X CE: X €1, NI}. 


For example, the intersection of the two natural partition matroids on the 
edges of a bipartite graph G has as its independent sets the matchings of G. 
These are generally not the independent sets of a matroid (see Exercises 1—2), 
and thus the greedy algorithm does not solve maximum-weighted matching. 

Recall that a loop is an element forming a nonempty set of rank 0. 
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8.2.48. Theorem. (Matroid Intersection Theorem, Edmonds [1970]) For ma- 
troids M,, M2 on E, the size of a largest common independent set satisfies 


max({|Z|: 7 €1y V1} = minyeg{ry(X) + re(X)}. 


Proof: (Seymour [1976]) For weak duality, consider arbitrary 7 € I; N Ip and 
X CE. The sets 11 X andIN X are also common independent sets, and |/| = 
POX] + |Z X| < ri(X) + r2(X). 

To achieve equality, we use induction on | E|; when |E| = 0 both sides are 0. 
If every element of E is a loop in M, or in Me, then max || = 0 = r1(X) +1re(X), 
where X consists of all loops in M,. Hence we may assume that |E| > 0 and 
that some e € E is a non-loop in both matroids. Let F = E — e, and consider 
the matroids M,|F, Mo|F, M1.F, and Mo.F. 

Let k = minyez{ri(X) + 1r2(X)}; we seek a common independent k-set in M, 
and Mbp. If there is none, then M,|F and M2|F have no common independent k- 
set, and M,.F and M2.F have no common independent k — 1-set. The induction 
hypothesis and rank formulas (Theorem 8.2.43) yield 


ry(X) +re(F — X) < k-1 for some X C F, and 
ri(Y +e) -—l+re(F —-Y+e)-1.< k-2 for some Y C F. 


r(X) +ro(X +e) +ri(¥ +e) +re(¥) < 2k-1. 


_Now we apply submodularity of r; to X and Y +e and submodularity of ro 
to Y and X +e. For clarity, write U =-X +e and V = Y +e. Applying this to 
the preceding inequality yields 


r(X UV) +r(X OV) +re(¥ UU) +re(¥ NU) < 2k-1. 


Since YN U = XUV and YUU = XNV/, the left side sums two instances 
of r1(Z) + re(Z), and the hypothesis k < ri(Z) + re(Z) for all Z C E yields 
2k < 2k — 1. Hence M, and M2 do have a common independent k-set. | 
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It can be helpful to restrict the range of the minimization. 


8.2.49. Corollary. The maximum size of a common independent set in ma- 
troids M;, M2 on E is the minimum of r;(X;) + re(X_g) over sets X1, X2 such 
that X; U Xo = E and each X; is closed in M;. 


Proof: The incorporation property implies that r;(o;(X)) = r;(X). a 
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We have proved special cases of the Matroid Intersection Theorem by other 
means. We proved the Kénig—Egervary Theorem in various ways, and we 
proved the Ford—Fulkerson characterization of CSDRs from Menger’s Theorem 
in Theorem 4.2.25. Whenever we have two matroids on the same set, the Ma- 
troid Intersection Theorem tells us that there must be a min-max relation for 
the maximum size of a common independent set, tells us what the result should 
be, and prevides a proof. 

8.2.50. Corollary. (Konig [1931], Egervary [1931]) In a bipartite graph, the 
largest matching and smallest vertex cover have equal size. 


Proof: When M; and M2 are the partition matroids on E(G) induced by the 
partite sets U;, U2 of G, the matchings are the common independent sets. For 
X1, X2 C E, the rank r;(X;) counts the vertices of U; incident to edges in X;. 
Hence if X; U Xo = E, then G has a vertex cover of size r1(X1) + re(X2), using 
vertices of U; to cover X;. Conversely, if T; U Tz is a vertex cover with T; € U;, 
let X; be the set of edges incident to 7;; we have X; U X2 = E with X; closed in 
M; and r1(X1) + re(X2) = |T1|.+ |T2|. We conclude that 


o’(G) = max{|J|: J € 1; Ng) = min{ry(X1) + re(Xe)} = B(G). Z 


The next corollary uses the rank fu nction for transversal matroids. 
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8.2.51. Example. Transversal matroids (see Example 8.2.13). Suppose that 
A,U.---UA,, = E, and let G be the corresponding incidence graph with partite 
sets E and [m]. Consider X CE. If INCY )| < |¥| for some Y C X, then Y forces 
at least |Y| — |N(Y)| unsaturated elements i in X. Hall’s Condition applied to X 
yields r(X) = min{|X| — (|¥| — |N(Y)|): Y C X} (Exercise 51). 

We obtain another expression for r(X) (see Ore [1955]). Let A(J) = 
U;<,A;; in terms of the graph, A(J) = N(J). By applying Hall’s Condition 
to [m] instead of E, we can write the maximum size of a matching as r(M) = 
min{m — (|J| — |A(J)|): J ¢ [m]}. To determine the maximum number of el- 
ements in X C E that can be matched, we discard the élements of E — X, 
obtaining r(X) = minycfmj{|A(J) N X| — |J| +m). 


N(Y) J 
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The first formula for r(X) uses neighborhoods of subsets of E; the second 
uses neighborhoods of subsets of [m]. Exercise 53 shows directly that the second 
rank formula is the rank function of a matroid, without relying on results from 
bipartite matching. Further material on transversals appears in Mirsky [1971] 
and in Lovasz—Plummer [1986]. | 
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8.2.52. Corollary. (Ford—Fulkerson [1958]) Families A = {A),..., A,,} and B = 
{B,,..., B,} have a common system of distinct representatives (CSDR) if 
and only if, for each /, J € [ml], 


(4008) = + l=. 


iel jel 


Proof: A common partial SDR is a common independent set in the two transver- 
sal matroids M,, M2 induced on E by A and B. To determine when there is a 
complete CSDR, we need only restate the condition r;(X) + ro(X) > m to find 
the appropriate condition on the set systems. 

The rank formulas from Example 8.2.51 yield 


ry(X) + 1re(X) = min{|A(/) A X|— || +m}t+ min {|B(J) NX] -|J| +m}. 


Hence r3(X) + r2(X) > m for all X if and only if 
JA) AX| + |B) X| = [Z| +|J| — m for all X C Evand I, J C [ml]. 


Given /, J, consider the contribution of an element of E to the left side. 
Kach element of A(7)M B(J) counts once whether it belongs to X or X. Elements 
of A(/) — B(J) count if and only if they belong to X, and elements of B(J) — A(/) 
count if and only if they belong to X. Hence the left side is minimized for I, J 
when A(/) — B(J) C X and B(J) — A(Z) C X. In this case the left side equals 
|A(1) N B(J)|, which yields the Ford—-Fulkerson condition. | 


The augmenting path approach to maximum bipartite matching general- 
izes to matroid intersection. The algorithm yields a common independent set / 
of maximum size and a set X such that r1(X) + ro(X) = |/7| (see Lawler [1976], 
Edmonds [1979], Faigle [1987]). Finding a maximum common independent set 
in three matroids is NP-complete (??s —). 


MATROID UNION 


The intersection of two matroids is seldom a matroid, but a natural concept 
of matroid union does always yield a matroid. Together with a useful min-max 
relation for the rank function, this is the content of the Matroid Union Theo- 
rem. The Matroid Intersection and Union Theorems are equivalent; they can 
be derived from each other. Welsh [1976] proves the Matroid Union Theorem 
first; here we obtain it from the Matroid Intersection Theorem. 


8.2.53. Definition. The union M; U.--U M, of hereditary systems M;,..., M; 
on E is the hereditary system M on E defined by Iy = {, U---Uk: ], € 
I;}. The direct sum M, @--- @ M, of hereditary systems M,,..., M, on 
disjoint sets £,,..., E, is the hereditary system M on E£, U-:-U E; defined 
by Iy ={,U---UA_:]; € I;}. 
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The direct sum M,@---@M;, on £),..., E, can be expressed as the union of 
M,,..-,M, on E’ = E,U---UE, by letting M; be a copy of M; with the additional 
elements of E’ — E; added as loops. When each M; is a uniform matroid, the 
direct sum is a generalized partition matroid. Here £;,..., E, partition E, 
there are positive integers r;,...,r,, and X € Lif |X N E;| <.r;. The partition 
matroids defined earlier arise when all r; = 1. 


8.2.54. Proposition. Given matroids M;,..., M;, on disjoint sets £1,..., Ex, 
the direct sum M = M, @--- @ M, is a matroid. 


Proof: Since the £;,..., E, are pairwise disjoint, the intersection of any J «I 
with each E; is independent in M;. If 11, Jz € I with |Je| > |/1|, then |/2 N E;| > 
[11 1 E;| for some i. Since both sets are independent in M;, we can augment 
I, E; from Ig N E; and therefore J; from Jz. Hence M; ®--- @ M, satisfies the 
augmentation property. a 


Using a direct sum, we prove that the union of matroids is always a ma- 
troid, and we compute the rank function. 


8.2.55. Theorem. (Matroid Union Theorem—Edmonds—Fulkerson [1965], 
Nash-Williams [1966]) If M,,...,.M, are matroids on E with rank func- 
tions r1,...,7,, then the union M = M; U---U M, is a matroid with rank 
function r(X) = minycx (|X — Y|+5¢7;(Y)). 


Proof: (following Schrijver [to appear]). After proving the formula for the rank 
function, we will verify the submodularity property to prove that M is a ma- 
troid. First we reduce the computation of the rank function to the computation 
of r(E). In the restriction of the hereditary system M to the set X, we have 
Imix = {Y Cx:Ye In} and ru\x(Y) = ru(Y) for Y = XxX. Thus M|X = U;(M; |X), 
and applying the formula for the rank of the full union to M|X yields ry(X). 
Consider a k by | E| grid of elements E’ in which the jth column £; consists 
of k copies of the element e; €¢ E. We define two matroids N;, No on E’ such 
that the maximum size of a set independent in both N; and Ne equals the 
maximum size of a set independent in M. We then compute ry(£) by applying 
the Matroid Intersection Theorem to N; and N2. Let M/ be a copy of M; defined 
on the elements E' of rowi in E’. Let N; be the direct sum matroid M|®---®M;, 
and let Ne be the partition matroid induced on E’ by the column partition {E;}. 
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Each set X € Iy has a decomposition as a disjoint union of subsets X; € I,, 
because I; is a hereditary family. Given a decomposition {X;} of X € Iy, let 
X; be the copy of X; in E'. Since {X;} are disjoint, UX; is independent in No, 
and xX; <€ I; implies that UX; is also independent in N,. From X é€ Iy, we 
have constructed UX; of size |X| in Iy, NIy,. Conversely, any X’ €-Iy, NIn, 
corresponds to a decomposition of a set in Iy of size |X ‘| when the sets X’/N E' 
are transferred back to E, because Ne forbids multiple copies of elements. 

Hence r(£) = max{|/|: J € Iy, NIy,}. To compute this, let the rank func- 
tions of N;, No be qi, gz, and let r; be the rank function of the copy Mj of M; 
on E'. We have q;(X’) = > 7r/(X'N E'), and q2(X’) is the number of elements 
of E that have copies in X’. The Matroid Intersection Theorem yields r(E) = 
min y’ce{qi(X’) + qo(E’ — X’)}. 

By Corollary 8.2.49, the minimum is achieved by a set X’ such that E’ — X’ 
is closed in Ng. The closed sets in the partition matroid Nz are the sets that 
contain all or none of the copies of each element—the unions of full columns of 
E’. Given X’ with E’ — X’ closed in No, let Y C E be the set of elements whose 
copies comprise X’. Then qo(E’ — X') = |E—Y|, and X’ contains all copies 
of the elements of Y, so qi(X’) = Dorj(X'N E') = ¥or;(Y). We conclude that 
r(E) =minyce{|E — Y|+ ori(Y)}. 

To show that M is a matroid, we verify submodularity for r. Given X, Y C 
E, the formula for r yields U C X and V C ¥Y such that 

r(X)=|X-Ul4+or(U); = r(Y)=I¥ —VI+ o7,(V). 


~ 


Since UNV © XNY and UUV C X UY, we also have 
r(XNY) <|(XNY)-WUaAV)I+ Don(UNV); 


r(X UY) <|(XUY)—(U UV) + Sori(U UV). 


After applying the submodularity of each r; and the diagram below, these in- 
equalities yield r(X NY) +r(X UY) <.r(X)+7r(Y). w 


) 


(XN Y)-(UNV) + [(XUY)-(UUV)| =|¥ —Ul+1¥ —-V| 


In applying the Matroid Intersection Theorem, we needed N, to be a ma- 
troid, which required {M;} to be matroids. Hence this rank formula does not 
apply for unions of arbitrary hereditary systems. 

The Matroid Union Theorem yields short proofs of min-max relations for 
packing and covering problems. In each formula below, the optimal subset is 
closed, since switching from X to o(X) improves the numerator without chang- 
ing the denominator. The graph corollaries originally had difficult ad hoc proofs. 
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8.2.56. Corollary. (Matroid Covering Theorem—Edmonds [1965b]) In a loop- 
less matroid M on E, the minimum number of independent sets whose 
|X| 


union is E is maxyce Ea 


Proof: Let M;,..., M; be copies of M on E. The set E is the union of k inde- 
pendent sets in M if and only if E is independent in M’ = M,U.---UM,. By the 
Matroid Union Theorem, r’(E) > |E| is equivalent to |E| — |Y|+ dor,(Y) = |E| 
for all Y Cc E. Since r;(Y) = r(Y) for all i, we conclude that E is the union of k 
independent sets if and only if kr(Y) > |Y| for all Y C E. a 


8.2.57. Corollary. (Nash-Williaims [1964]) The minimum number of forests 


needed to cover the edges of a graph G (its arboricity) is maxycg | sees |. 


Proof: (Edmonds [1965b]) This follows immediately by applying Corollary 
8.2.56 to M(G). The best lower bound arises from a connected induced sub- 
graph H (corresponding to a closed set in M(G)). | 


8.2.58. Corollary. (Matroid Packing Theorem—Edmonds [1965c]) Given a ma- 
troid M on E, the maximum number of pairwise disjoint bases equals 


minx: r(X)<r(E) | eae, |. 
Proof: The set E contains k disjoint bases if and only if r‘(E) > kr(E) in the 
union M’ of k matroids M;,..., M, that are copies of M on E. By the Matroid 
Union Theorem, this requires |E| — |Y p+ > ri(Y) => kr(E£) for all Y C E. Since 
ri(Y) = r(Y) for all i, we conclude that k disjoint bases exist if and only if 
|E| -|¥| = k(r(e) —r(Y)) for all Y CE w 


rks 


8.2.59. Corollary. (Nash-Williams [1961], Tutte [1961a]) A graph G has k pair- 
wise edge-disjoint spanning trees if and only if, for every vertex partition 
P, there are at least k(|P| — 1) edges with endpoints in different sets of P. 


Proof: (Edmonds [1965c]) We may assume that G is connected. By applying 
Corollary 8.2.58 to M(G), we must determine when |E| — |X| > k(r(E) — r(X)) 
for each closed set X. The closed sets correspond to partitions of V(G) into 
vertex sets inducing connected subgraphs. For each such partition Vj,..., Vp, 
the corresponding closed set X is |) E(G[V,]) with rank n — p. Since |E| — |X| 
counts the edges between sets of the partition and r(E) — r(X) > p — 1, the 
graph has k disjoint spanning trees if and only if the condition holds. | 


EXERCISES 


8.2.1. (—) Show that the stable sets of a graph need not be the independent sets of 
a matroid by finding vertex-weighted graphs where the tatio between the maximum 
weight of a stable set and the weight of a stable set found greedily is arbitrarily large. 


8.2.2. (—) Characterize the graphs whose stable sets form the family of independent 
sets of a matroid on the set of vertices. 
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8.2.3. (—) Show that every partition matroid is a transversal matroid. 


8.2.4. Modify the greedy algorithm to obtain (with proof) an algorithm for finding the 
maximum-weighted independent set in a matroid with arbitrary real weights (not nec- 
essarily nonnegative) on the elements. 


8.2.5. Characterize the graphs whose matchings form the family of independent sets of 
a matroid on the set of edges. 


8.2.6. (!) Determine which uniform matroids are graphic. Characterize the graphs 
whose cycle matroids are uniform matroids. 


8.2.7. (!) Determine which partition matroids are graphic. Characterize the graphs 
whose cycle matroids are partition matroids. 


8.2.8. Using only linear dependence, prove that vectorial matroids satisfy the induced 
circuit property: adding an element to a linearly independent set of vectors creates at 
most one minimal dependent set. 


8.2.9. Describe the circuits of a transversal matroid M in terms of the corresponding 
bipartite graph G. Using only properties of bipartite graphs, prove that M satisfies the 
weak elimination property. 


8.2.10. Let M(G) be the cycle matroid of G. Let k(X) be the number of components of 
the spanning subgraph Gy with edge set X; sor(X) =n—k(X). Let U and V be the sets 
of components in Gy and Gy, respectively. ‘Let H be the U, V-bigraph with u < v when 
the components corresponding to u and v intersect. 

a) Count the vertices and components of H in terms of the numbers k(X), k(Y), and 
k(X OY). Prove that k(X UY) > e(A). 

b) Use part (a) to prove the submodularity property for M(G) without using other 
properties of matroids. (Aigner [1979]) = = 


me/1 


elegra 


~ 


8.2.11. Use the Kériig—Egervary Theorem to prove directly that the rank function of a 
transversal matroid is submodular. 


8.2.12. Let D be a digraph with distinguished source s and sink t. Let E = V(D) —{s, f}. 
For X C E, letr(X) be the number of edges from sUX to X Ut. Prove that r is submodular. 


8.2.13. (—) For an element x in a hereditary system, prove that the following properties 
are equivalent and characterize loops. 


a) r(x) = 0. d) x belongs to no Base. 
b) x €a0(@). e) Every set containing x. is dependent. 
c) x is a circuit. f) x belongs to the span of every X C E. 


8.2.14. (—) Prove equivalence of the following characterizations of parallel elements, 
assuming that x 4 y and neither is a loop. 

a)r(x,y)=1. 

b) {x, y} EC. 

c)x €a(y), y € a(x), r(x) =r(y) = 1. 
Furthermore, show that if x, y are parallel and x € ao (X), then y € o(X). 


8.2.15. (—) Suppose that r(X) = r(X NY) for somé X, Y C E in a matroid on E. Prove 
that r(X UY) =r(Y). Does the converse hold? 


' 8.2.16. Let M be a hereditary system with nonnegative weights on E. Prove directly 
that if M satisfies the base exchange property (B), then the greedy algorithm always 
yenerates a maximum-weighted base. 
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8.2.17. Alternative matroid axiomatics. Let M be a hereditary system. Prove the fol- 
lowing implications directly for M. 

a) (—) Submodularity (R) implies weak absorption (A). 

b) Strong absorption (A’) implies submodularity (R) (without using uniformity). 
(Hint: Use induction on |XaY|.) 

c) Base exchange (B) implies uniqueness of induced circuits (J). 

d) (--) Uniqueness of induced circuits (J) implies weak elimination (C). 

e) Uniqueness of induced circuits (J) implies augmentation (I). (Hint: Use J and 
induction on |/, — /2| to obtain the augmentation.) 


8.2.18. Prove that a hereditary system is a matroid if and only if it satisfies the “ultra- 
weak” augmentation property: If 1,, lz € I with |/2| > |/;| and | — I2| = 1, then I, +e € I 
for some e € Ig — 1h. (Chappell [1994al) 


8.2.19. (—) Let M be a matroid on E£, and fix A C E. Obtain I’ from I by deleting the 
sets that intersect A. Prove that I’ is the family of independent sets of a matroid on E. 


8.2.20. For a matroid on E with e ¢ B € B, let C(e, B) be the unique circuit in B +e. 
a) For e ¢ B, prove that B — f +e is a base if and only if f belongs to C(e, B). 
b) For e € C € C, prove that C = C(e, B) for some base B. 


8.2.21. (—) Let B,, By be bases of a matroid such that |B,sBo| = 2. Prove that there is 
a unique circuit C such that BjaBg CC C By U Bo. 


8.2.22. (—) Let B,, Be be bases in a matroid M. Given X; C B,, prove that there exists 
X2 C Bz such that (By — X;)U X2 and (By — X2) UX, are both bases of M. (Greene [1973]) 


8.2.23. (!) Let By, Bo be distinct bases of a matroid M. 

a) Let G be a Bj), Bo-bigraph with e € By adjacent to f € Bz when B,+e— f €B. 
Prove that G has a perfect matching. 5 

b) Conclude from part (a) that there exists a bijection 2: B,; — Bz such that for each 
e € Bi, the set Bo — m(e) +e is a base of M. 


8.2.24. (!) Let B,, Bo be distinct bases of a matroid M. 
a) Prove that for each e € B,, thereis f € Bz such that B, —e+ f and B.— f +e are 
bases. (Hint: Use the incorporation property. Note: This generalizes Exercise 2.1.34.) 
b) Use the cycle matroid M(K,) to show that there may be no bijection 7: B, > Be 
such that e and f = x(e) satisfy part (a) for all e € By. 


8.2.25. (—) A collection of |E| —r(£) circuits of a matroid on E form a fundamental set 
of circuits if it is possible to order the elements e;,..., e, in such a way that C; contains 
€+(£)+; but no higher-indexed element. Prove that every matroid has a fundamental set 
of circuits. (Whitney [1935]) 


8.2.26. (—) Given k distinct circuits {C;} with none contained in the union of the others, 
and given a set X with |X| < k, prove that Li C; — X contains a circuit. (Welsh [1976]) 


8.2.27. (+) For a hereditary system, prove directly that the weak elimination property 
implies the strong elimination property, using induction on |C; U C2|. (Lehman [1964]) 


8.2.28. (!) Min-max relation for weighted independent set. Let M be a matroid on E, 
with each e € E having nonnegative integer weight w(e). Let A be the set of chains 
X, © X2 © --- such that each e € E appears in at least w(e) sets in the chain (sets may 
repeat in the chain). Use the greedy algorithm to prove that 


max) <¢] ee w(e) = MiNy,j)cA a r(X;). 
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8.2.29. (—) Let r and o be the rank function and span function of a matroid. Prove that 
r(X) = min{|Y|: Y C X,a(Y) = a(X)}. 


8.2.30. Prove that a matroid of rank r has at least 2’ closed sets. (Lazarson [1957]) 


8.2.31. Prove that a matroid is simple if and only if 1) no element appears in every hy- 
perplane, and 2) from every distinct pair of elements some hyperplane contains exactly 
one. Prove that these conditions also suffice for a family of sets to be the collection of 
hyperplanes of a simple matroid. 


8.2.32. Prove that in a matroid, a set is a hypobase if and only if it is a hyperplane. 


8.2.33. Use the weak elimination property to characterize when a family of sets is the 
family.of hyperplanes of some matroid. 


8.2.34. Prove that the closed sets of a matroid are the complements of the unions of 
cocircuits. 


8.2.35. Let.X be a closed set in a matroid M. 

a) Let Y be a closed set contained in X such that r(Y) = r(X) — 1. Prove that M has 
a hyperplane H such that Y = XN H. (Hint: Given a maximal independent subset Z of 
Y, augment it by e € X and then to a base B, and let H = a (B — e).) 

b) Prove that X is the intersection of r(M) — r(X) distinct hyperplanes. 


8.2.36. Prove the following properties of closed sets in a matroid. 

a) The intersection of two closed sets is a closed set. 

b) The span of a set is the intersection of all closed sets containing it. (Comment: 
Hence o(X) is the unique minimal closed set containing X.) 

c) The union of two closed sets need not be a closed set. 


8.2.37. Prove that M.X has no loops if and only if X is closed. 


8.2.38. (!) Bases and cocircuits in matroid® 

a) Prove that when e belongs to a base B in a matroid M, there is exactly one 
cocircuit of M disjoint from B — e, and it contains e. 

b) Use part (a) to prove that if C is.a circuit of a matroid M and x, y are distinct 
elements of C, then there is a cocircuit C* € C* with C* NC = {x, y}. (Minty [1966]) 

c) Explain why part (b) is trivial for cycle matroids. 


8.2.39. (—) Show that the dual of a simple matroid (no loops or parallel elements) need 
not be simple. Determine whether a set can be both a circuit and a cocircuit in a matroid. 


8.2.40. (!) Use matroid duality to prove Euler’s Formula for connected plane graphs. 


8.2.41. Prove that any minor of a matroid obtained by restricting and then contracting 
can also be obtained by contracting and then restricting. In particular, if M is a matroid 
on £ and Y C-X C E, prove that (M|X).Y = (M.X — Y)|Y and (M.X)|Y = (M|X — Y).Y. 


8.2.42. (!) Use duality and matroid restriction to prove that ry. -(X) = ru(XUF)—ry(F). 
Also derive the formula directly by proving that X is independent in M.F if and only if 
adding X to F increases the rank by |X|. 


8.2.43. Prove that the cycle matroid M(G) is the column matroid over Z2 of the vertex- 
edge incidence matrix of G. (Hence every graphic matroid is binary.) 
8.2.44, Tutte [1958] proved that a matroid if and only if it has no U2,4-minor. 

a) Prove that the matrix ci ; a) represents U2, over Z3. 

b) Prove that U2,4 has no representation over Zp. 
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8.2.45. Prove that the three operations below preserve the cycle matroid of G. 

a) Decompose G into its blocks B,,..., B,, and reassemble them to form another 
graph G’ with blocks By, ..., By. 

b) In a block B of G that has a two-vertex cut {x, y}, interchange the neighbors of x 
and y in one of the components of B — {x, y}. 

c) Add or delete isolated vertices. 


- 


(Comment: Whitney’s 2-Isomorphism Theorem [1933b] states that G and H have the 
same cycle matroid if and only if some sequence of these operations turns G into H. 
Thus every 3-connected planar graph has only one dual graph, meaning essentially only 
one planar embedding. See also Kelmans [1980]) 


8.2.46. Construct a graph without isolated vertices that is an abstract dual of the graph 
below but is not a geometric dual of this graph. (Hint: Consider the operations of 
Exercise 8.2.45.) (Woodall, in Welsh [1976], p91-92) 


8.2.47. The matroid basis graph is the graph having a vertex for each base of a ma- 
troid, with bases adjacent when their symmetric difference has size 2. Prove that every 
matroid basis graph has a spanning cycle, and interpret the result for graphic matroids 
and for uniform matroids. (Hint: Use contraction and restriction inductively to establish 
a spanning cycle through any edge.) (Holamann-Harary [1972], Kung [1986, p72]) 


8.2.48. Use weak duality of linear programming to prove the weak duality property for 
matroid intersection: |/| < r}(X) + re(X) for any Te, NI, and X CE. (Hint: Consider 
the discussion of dual pairs of linear programs in Remark 8.1.7.) 


8.2.49. Let M,, M2 be two matroids on E. 

a) Prove that the minimum size of a set in E that is spanning in both M, and Mg is 
maxyce(ri(E) — r1(X) + re(E) — re(X)). 

b) Apply part (a) to prove that in a bipartite graph with no isolated vertices the 
minimum number of edges needed to cover all the vertices equals the maximum number 
of vertices with no edges among them. (K6nig’s “other” theorem) 

c) From part (a), prove that the maximum size of a common independent set plus 
the minimum size of acommon spanning set equals r}(E)+re(E). In particular, conclude 
Gallai’s Theorem for bipartite graphs: in a bipartite graph with no isolated vertices, the 
maximum size of a matching plus the minimum number of edges needed to cover the 
vertices equals the number of vertices. 


8.2.50. Use the Matroid Intersection Theorem to prove that in every acyclic orientation 
of G the vertices can be covered with at most a(G) pairwise-disjoint paths. (Chappell 
[1994b]) (Comment: This is the special case of Theorem 8.4.33 for acyclic digraphs.) 


8.2.51. (—) Let M be the transversal matroid on E = UA; induced by sets Aj,..., Am. 
Use Hall’s Theorem for matchings in bipartite graphs to derive the rank function as 
r(X) = minycx{|X{ — (/¥| — |N(Y)))}. 


8.2.52. Let G be an E, [m]-bigraph without isolated vertices. For X C E, let r(X) = 
min{|N(J) 1 X| —|J| +m: J ¢ [m]}. Prove that the following are equivalent for X. 
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A) Hall’s Condition holds (|N(S)| > |S| for all S C X). 

B) r(X) > |X|. 

C) X is saturated by some matching in G. 
(Hint: The proof of B= C uses paths from unsaturated vertices that alternate between 
edges outside and within a specified matching.) 


8.2.53. (!) Let G be an E, [m]-bigraph without isolated vertices. For X C E and J C [ml, 
let 9(X, J) = |N(J)N X| — |J|, and let r(X) = min{g(X, J) +m: J C [m]}. Say that J is 
X-optimal if r(X) = g(X, J) +m. 

a) Prove that r(@) = 0 and that r(X) < r(X +e) <r(X) +1. 

b) Prove that r satisfies the weak absorption property. 


8.2.54. Prove that restrictions and unions of transversal matroids are transversal ma- 
troids, but that contractions and duals of transversal matroids need not be. 


8.2.55. Gammoids. Let D be a digraph, and let F, E be subsets of V(D). The gammoid 
on E induced by D, F is the hereditary system given by I = {X C E: there exist |X| 
pairwise disjoint paths from F to X}; equivalently, r(X) is the maximum number of 
pairwise disjoint F, X-paths. 

a) Verify that every transversal matroid is a gammoid. 

b) (+) Prove that every gammoid is a matroid. (Hint: Use Menger’s Theorem to 
verify the submodularity property. Verifying the augmentation property is also possible 
but somewhat longer.) (Mason [1972]) 


8.2.56. Strict gammoids. Let D be a directed graph, let F, E be subsets of the vertices 
of D, and let M be the gammoid on E induced by D, F (Exercise 8.2.55). When E con- 
sists of all vertices of D, the gammoid is a strict gammoid. Prove that a matroid is a 
strict gammoid if and only if it is the dual‘of a transversal matroid. (Hint: Use a nat- 
ural correspondence between directed graphs on n vertices and bipartite graphs on 2n 
vertices.) (Ingleton—Piff [1973]) Bb 


8.2.57. (—) Since the union of two matroids is a matroid, there should be a dual opera- 
tion yielding its dual. Given matroids M,, Mz with spanning sets S,, S;, let M; \ M2 be 
the hereditary system whose spanning sets are {X; M X2: X; € Si, X2q € Se}. Prove that 
M, “4 Mz is the matroid (Mj U M3)’. 


8.2.58. Generalized transversal matroids. 

a) Let M be a matroid on E, and let A = {A),..., Am} be a set system on E. Let 
M' be the hereditary system on [m] whose independent sets are the subsets of A having 
transversals that belong to Iy. Prove that M’ is a matroid with rank function r’(X) = 
minycx{|X — Y| + r(A(¥))}. 

b) Let E, F be finite sets, and let f be a function from E to F. For X C E, let f(X) 
be the set of images of elements of X. Let M be a matroid on £. Let M’ be the hereditary 
system on F defined by Iw = {f(X): X € Iy}. Prove that M’ is a matroid. Prove also 
that r’(X) = minycx {|X — Y| +r(f7/(Y))} when f is surjective. 

8.2.59. Apply matroid sum and Exercise 8.2.58 to prove the Matroid Union Theorem. 
8.2.60. (!) Prove that the maximum size of a common independent set in matroids M, 
and M2 on EF’ is rms (E )- rm (E). Use this to prove the Matroid Intersection Theo- 
rem by applying the Matroid Union Theorem to M; U Mj. (Comment: Thus these two 
theorems are equivalent.) 


8.2.61. Let G be an n-vertex weighted graph, and let £),..., E,-; be a partition of E(G) 
into n — 1 sets. Is there a polynomial-time algorithm to compute a spanning tree of 
minimum weight among those that have exactly one edge in each subset E;? 
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8.2.62. (!) Use the characterization of graphs having k pairwise edge-disjoint spanning 
trees (Corollary 8.2.59) to prove that every 2k-edge-connected graph has k pairwise edge- 
disjoint spanning trees. Exhibit for each k a 2k-edge-connected graph that does not have 
k +1 pairwise edge-disjoint spanning trees. (Nash-Williams [1961]) 


8.2.63. Given matroids M,,..., M, on E, the Matroid Partition Problem is the prob- 
lem of deciding whether an input set X C E partitions into sets /,,..., /, with J; « I. 

a) Use the Matroid Union Theorem to show that X is partitionable if and only if 
|X —Y|+ > rj(Y) > |X| for all Y C X, and that all maximal partitionable sets are max- 
imum partitionable sets. 

b) Let M’ be the union of k copies of a matroid M on E, and let X be a maximum 
partitionable set. Prove that there are disjoint sets Fi,..., & © X such that {F;} CI 
and X C o(F\) =-:- =o (F,). 


8.3. Ramsey Theory 


“Ramsey theory” refers to the study of partitions of large structures. Typ- 
ical results state that a special substructure must occur in some class of the 
partition. Motzkin described this by saying that “Complete disorder is impossi- 
ble”. The objects we consider are merely sets and numbers, and the techniques 
are little more than induction. 2 

Ramsey’s Theorem generalizes the pigeonhole principle, which itself con- 
cerns partitions of sets. We study applications of the pigeonhole principle, 
prove Ramsey’s Theorem, and then focus on Ramsey-type questions for graphs. 
Finally, we discuss Sperner’s Lemma about labelings of triangulations; like 
Ramsey’s Theorem, it guarantees a special substructure. 


THE PIGEONHOLE PRINCIPLE REVISITED 


The pigeonhole principle (Lemma A.57) states that if m objects are parti- 
tioned into n classes, then some class has at least [m/n] objects (and some class 
has at most |m/n] objects). This is a discrete version of the statement that 
every set of numbers contains a number at least as large as the average (and 
one at least as small). The concept is simple, but the applications can be quite 
subtle. The difficulty is how to define a partitioning problem relevant to the 
desired application. We illustrate this with four examples. 


8.3.1. Proposition. Among six persons it is possible to find three mutual ac- 
quaintances or three mutual non-acquaintances. 


Proof: (Exercise 1.1.29). In the language of graph theory, we are asked to show 
that for every simple graph G with six vertices, there is a ‘triangle in G or.in 
G. The degrees of vertex x in G and G sum to 5, so the pigeonhole principle 
implies that one of them is at least 3. 
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By symmetry, we may that assume dg(x) > 3. If two neighbors of x are 
adjacent, then they form a triangle in G with x; otherwise, three neighbors of 
x form a triangle in G. | 
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8.3.2. Theorem. (Graham—Entringer—Székely [1994]) If T is a spanning tree 
of the k-dimensional cube Q,, then there is an edge of 0, outside T whose 
addition to T creates a cycle of length at least 2k. 


Proof: For each vertex v of Q;, expressed as a binary k-tuple, there is a com- 
plementary vertex v’ that differs from v in each position. There is a unique 
v, v'-path in T; orient its first edge toward v’. Since n(Q,) = e(T) +1, doing this 
for each vertex orients some edge twice, by the pigeonhole principle. 

Since this edge uv receives an orientation from u and from v, we have v on 
the u, u’-path and u on the v, v’-path in T. Hence the u, v’-path and the v, u’- 
path in T are disjoint. Each has length at least k — 1, since the distance in Q, 
between a vertex and its complement is k. Finally, u — v in Q; implies also 
u' <> v’, which completes a cycle of length at least 2k. a 
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Theorem 8.3.2 implies that every spanning tree of Q, has diameter at least 
2k — 1 (Graham—Harary [1992]). 


8.3.38. Theorem. (Erdés—Szekeres [1935]) Every list of more than n? distinct 
numbers has a monotone sublist of length more than n. 


Proof: Let a = aj, ..., ;241 be the list. Assign position k the label (x;,, y,), where 
x, is the length of a longest increasing sublist ending at a,, and y, is the length of 
a longest decreasing sublist ending at a,. Ifa has no monotone sublist of length 
n-+1, then x, and y, never exceed n, and there are only n? possible labels. 
Since the list has length n? + 1, the pigeonhole principle now implies that 
two labels must be the same. This is impossible when the elements of a are 
distinct. When i < j and a; < a;, we can append aq; to the longest increasing 
sequence ending at a;. When i < j and a; > a;, we can append a; to the longest 
decreasing sequence ending at a;. (See Exercise 5.1.43 for a generalization.) & 


a. 67 4 1 8 5 2 9 6 3 0 
x,y: 1,1 1,2 1,3 2,1 2,2 2,3 3,1 3,2 3,3 4,1: 
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8.3.4. Theorem. (Graham—Kleitman [1973]) In every labeling of E(K,,) using 
distinct integers, there is a trail of length at least n — 1 along which the 
labels strictly increase. 


Proof: We assign each vertex a weight equal to the length of the longest in- 
creasing trail ending there. If we can show that these n weights sum to at least 
n(n — 1), then the pigeonhole principle guarantees a vertex with a large enough 
weight. The problem is how to compute the weights and their sum. 

We grow the graph from the trivial graph by adding the edges in order, 
. updating the weights and their sum at each step. The vertex weights begin 
at 0, If the next edge joins two vertices whose weights were both i, then their 
weights both become i + 1. If it joins two vertices of weights i and j withi < j, 
then their weights become j + 1 and j. 

In either case, each time an edge is added, the sum of the weights of the 
vertices increases by at least 2. Therefore, when the construction is finished, 
the sum of the vertex weights is at least n(n — 1). a 


Finally, we note that the thresholds in the classes may differ. 


8.3.5. Theorem. If >> p; — k + 1 objects are partitioned into k classes with 
quotas {p;}, then some class must meet its quota. 


Proof: If not, then at most 5°(p; — 1) objects can be accommodated. a 


RAMSEY’S THEOREM 


telegram.me/t 


The pigeonhole principle guarantees a class with many objects when we 
partition objects into classes. The famous theorem of Ramsey [1930] makes 
a similar statement about partitioning the r-element subsets of objects into 
classes. Roughly put, Ramsey’s Theorem says that whenever we partition the 
r-sets in a sufficiently large set S into k classes, there is a p-subset of S whose 
r-sets all lie in the same class. 

A partition is a separation of a set into subsets, and the set we want to 
partition consists of subsets of another set, so for clarity we use the language 
of coloring instead of the language of partitioning. Recall that a k-coloring of 
a set is a partition of it into k classes..A class or its label is a color. Typically 
we use [k] as the set of colors, in which case a k-coloring of X can be viewed as 
a function f: X > [k]. 


8.3.6. Definition. Let (°) denote the set of r-element subsets (r-sets) of a set 
S. Aset T C S is homogeneous under a coloring of (°) if all r-sets in T 
receive the same color; it is i-homogeneous if that color is i. 

Let r and p;,..., py be positive integers. If there is an integer N such 
that every k-coloring of ('*') yields an i-homogeneous set of size p; for some 
i, then the smallest such integer is the Ramsey number R(pj,..., 2:57). 
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Ramsey’s Theorem states that such an integer exists for every choice of r 
and pi,..., px (the latter are called thresholds or quotas). When the quotas 
all equal p, the theorem states that every k-coloring of the r-sets of a sufficiently 
large set has a p-set whose r-sets receive the same color. A thorough study 
of Ramsey’s Theorem and other partitioning theorems appears in Graham— 
Rothschild—Spencer [1980, 1990]. 

Before proving the theorem, we consider the case r = k = 2, which is easy 
to describe in terms of edge-coloring of graphs. The proof for this case has the 
same structure as for the general case. 

When r = 2, a k-partition of (°) is merely a k-edge-coloring of the complete 
graph with vertex set S (not a proper edge-coloring). When k = 2, the time- 
honored tradition in Ramsey theory is that color 1 is “red” and color 2 is “blue”. 

By Proposition 8.3.1, R(3, 3; 2) < 6; we extend the argument to prove that 


R( py, po: 2) < R(pi — 1, po: 2) + R(p1, pe — 1: 2). 


Assuming that R(p; — 1, po: 2) and R(pi, po — 1; 2) exist, let N be their 
sum. Proving the bound for R(pi, po: 2) means showing that every red/blue- 
coloring of the edges of a complete graph with N vertices yields a p;-set of 
vertices within which all edges are red or a po-set of vertices within which all 
edges are blue. 

Consider a red/blue-coloring of K é and choose a vertex x. Let s = R(p; — 

1, po; 2) and t = R(py, pe — 1; 2); there are s +t — 1 vertices other than x. 
Theorem 8.3.5 amplies that x has at least s incident red edges or at least t 
incident blue edges. E 

By symmetry, we may assume that x has at least N incident red edges. By 
the definition of s, the complete subgraph induced by the neighbors of x along 
these edges has a blue po-clique or a red p; — 1-clique. The latter would combine 
with x to form a red p;-clique. In either case, we obtain an i-homogeneous set of 
size p; for some i. We postpone discussion of the resulting bound on R(pj, po; 2). 


p00k 


[S| > R(p1, pe — 1; 2) or |T|.> R(p1 —1, pe; 2) 


8.3.7. Theorem. (Ramsey [1930]) Given positive integers r and pj,..., Dx, 
there exists an integer N such that every k-coloring of (' ™) yields an i- 
homogeneous set of size p; for some i. 


Proof: The proof is a “double” induction. We use induction on r, but the proof 
of the induction step itself uses induction on > p;. 

Basis step: r = 1. By Theorem 8.3.5, R(p1,..., px; 1) exists. 

Induction step: r > 1. We assume that the claim in the theorem statement 
holds for k-colorings of the r — 1-subsets of a set, no matter what the thresholds 
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are. We prove the same statement for k-colorings of the r-subsets of a set by 
induction on the sum of the quotas, )> p;. 

Basis step: some quota p; is less than r. In this case, a set of p; objects con- 
tains no r-sets, so vacuously its r-sets all have colori. Hence R(pj,..., px; r) = 
min{pi,..., px} when min{pi,..., py} <r. 

For clarity, we state the induction step only for k = 2; the argument for 
general k is similar (Exercise 17). Write (p,q) for (pi, pz). Let 


p’ = R(p--1,4@;1r), q’ = R(p,q-1:7r), N=1+R(p',qr—D. 


By the induction hypothesis of the inner induction, p’ and q’ exist. By the 
induction hypothesis of the outer induction, N also exists. Note that p’ and q’ 
may be very large; this is why we need the double induction. 

Let S be a set of N elements, and choose x € S. Consider a 2-coloring f 
of (°). With colors (red,blue), we need to show that f has a red-homogeneous 
p-set or a blue-homogeneous q-set. 

We use f to induce a 2-coloring f’ of the r — 1-sets of S’ = S —x. This is 
the reason for our choice of |S’| as a Ramsey number for r — 1-sets. Define f’ 
by assigning color i to an (r — 1)-set in S’ if its union with x has color i under 
f. Since |S’| = R(p’, q’; r — 1), the induction hypothesis implies that some color 
meets its quota (p’ or q’) under f’ (when r = 2, this step was the invocation of 
the pigeonhole principle). By symmetry, we may assume that the red quota is 
met. Let T be a p’-element subset of S= whose r — 1-sets are red under f’. 

We return to the original coloring f on the r-sets in T. Since |T| = 
p’ = R(p — 1,q;r), under f there is a red-homogeneous p — 1-set or a blue- 
homogeneous q-set in T. If there is a blue-homogeneous q-set, then we are done. 
If there is a red-homogeneous p — 1-set-P, then consider P U {x}. From the defi- 
nition of T, the (r — 1)-sets of P are all red under f’, which means their unions 
with x are red under f. Hence P U {x} is a red-homogeneous p-set under f. @ 


T 
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Like the pigeonhole principle, Ramsey’s Theorem has subtle and fascinat- 
ing applications. Ramsey’s Theorem typically gives an elegant existence proof 
but a horribly large bound. 


8.3.8. Theorem. (Erdés—Szekeres [1935]) Given an integer m, there exists a 
(least). integer N(m) such that every set of at least N(m) points in the plane 
with no three collinear contains an m-subset forming a conyex m-gon. 


Proof: We need two facts. (1) Among five points in the plane, four determine 
a convex quadrilateral (if no three are collinear). Construct the convex hull of 
the five points. If it is a pentagon or a quadrilateral, then the result follows 
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immediately. If it is a triangle, then the other two points lie inside. By the 
pigeonhole principle(!), two of the vertices of the triangle are on one side of the 
line through the two inside points. These two vertices together with the two 
points inside form a convex quadrilateral, as illustrated below. 


QHA © 


In a convex m-gon, any four corners determine a convex quadrilateral. We 
need the converse: (2) If every 4-subset of m points in the plane forms a convex 
quadrilateral, then the m points form a convex m-gon. If the claim fails, then the 
convex hull of the m points consists of t points, for some t < m. The remaining 
points lie inside the t-gon. When we triangulate the t-gon, as illustrated on the 
right above, a point inside lies in one of the triangles. With the vertices of that 
triangle, it forms a 4-set that does not determine a convex quadrilateral. 

To prove the theorem, let N = R(m, 5; 4). Given N points in a plane with 
no three on a line, color each 4-set by convexity: red if it determines a convex 
quadrilateral, blue if it does not. By fact (1), there cannot be five points whose 
4-subsets are all blue. By Ramsey’s Theorem, this means there are m points 
whose 4-subsets are all red: By fact (2), they form a convex m-gon. Hence N(m) 
exists and is at most’ R(m, 5; 4). a 


P 


The bound R(m, 5; 4) is very loose. It is exact for m = 4, where fact (1) 
implies that N (4) = 5 = R(4, 5; 4). In contrast, N(5) = 9 (Exercise 10), but 
R(5, 5; 4) is enormous. Erdés and Szekeres conjectured that N(m) = 2"-2 +1 
and proved that 2"-? < N(m) < (™>) +1. 

Another application concerns search strategies for numbers stored in ta- 
bles. From a set U, a subset of size n is stored in a table of size n according to 
some rule for storing n-sets. Yao [1981] used Ramsey’s Theorem to prove that 
when U is large, the strategy minimizing the worst-case number of probes re- 
quired to test whether some element of U is in the table is to store the chosen 
set in sorted order and test membership by binary search. (For small U, this 
strategy is not best!) The value that Ramsey’s Theorem yields for “large” is 
probably much larger than needed. 


me/mathe 


RAMSEY NUMBERS 


Ramsey’s Theorem defines the Ramsey numbers R(pj,..., px; r). No exact 
formula is known, and few Ramsey numbers have been computed. To prove 
that R(pi,..., Pe; r) = N, we must exhibit a k-coloring of the r-sets among N —1 
points that meets no quota (or show that one exists without constructing it), 
and we must show that every coloring on N points meets some quota. 

In principle, we could use a computer to examine all k-colorings of (a) for 
successive n until we find the first N such that every such coloring meets a 
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quota p; for some i. Even for 2-color Ramsey numbers, 2°) rapidly becomes too 
large to contemplate. Erdés joked that if an alien being threatened to destroy us 
unless we told it the exact value of R(5, 5), then we should set all the computers 
in the world to work on an exhaustive solution. If we were asked for R(6, 6), 
then his advice was to try to destroy the alien. 

When r = 2, we abbreviate the notation R(pi,..., px; r) to R(pi,..., Dx): 
When p = p; = -:: = px, we abbreviate it to R,(p;r). For r > 2, little is known 
other than R(4, 4; 3) = 13 (McKay—Radziszowski [1991]). Even for r = 2, only 
one Ramsey number is known exactly when k > 2, which is R(3, 3, 3) = 17. The 
table below contains the known values of R(p, q) and the best known upper and 
lower bounds for several other values as of July 1999. Several of these bounds 
have improved slightly since the first edition of this book. The current bounds 
are maintained in Radziszowski [1995], which is periodically updated. 


3.é«4 5 6 7 8 9 
6 9 14 18 23 28 36 
18 825 35/41 49/61 55/84 69/115 
43/49 58/87 80/143 95/216 116/316 
102/165 109/298 122/495 153/780 


The computations of R(3, 9) (Grinstead—Roberts [1982]), R(3, 8) (McKay— 
Zhang [1992]), and R(4, 5) (McKay—Radziszowski [1995]) are recent; the oth- 
ers are much older (due primarily to Greenwood—Gleason [1955], Kalbfleisch 
[1967], and Graver—Yackel [1968]). 

We prove only the first two of these results (see Exercise 16 for ae 5)). 
When r = k = 2, we simplify terminology by using two colors called “in” and 

“out”. Ramsey’s Theorer for this case then becomes: “There exists a minimum 
integer R(p, q) such that every graph on R(p, q) vertices has a clique of size p 
or an independent set of size q”. 
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8.3.9. Example. R(3,3) = 6. We showed earlier that R(3, 3) < 6. Since the 
5-cycle has no triangle and no independent 3-set, R(3, 3) > 6. a 


8.3.10. Example. R(3,4) = 9. The graph below has no K3 and no Kg, since 
four independent vertices on an 8-cycle include pairs of opposite vertices on the 
cycle. Hence R(3, 4) > 9. 


Given a vertex x in a graph G, we can add x to two adjacent neighbers to 
form a triangle or add x to an independent 3-set of nonneighbors to form an 
independent 4-set. Since R(2, 4) = 4 and R(3, 3) = 6, we conclude that if x has 
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four neighbors or has six nonneighbors, then G has a triangle or an independent 
4-set. Avoiding both possibilities limits x to at most three neighbors and at most 
five nonneighbors, which yields n(G) < 9. If this occurs for a 9-vertex graph, 
then every vertex has exactly three neighbors. Since the degree-sum formula 
forbids 3-regular graphs of order 9, we obtain R(3, 4) = 9. | 


The proof of Ramsey’s Theorem yields a (very large) recursive upper bound 
on R(p,q;r). Graham—Rothschild—Spencer [1980, 1990] explains how large. 


8.3.11. Theorem. R(p,q) < R(p — 1,4) + R(p,q — 1). If both summands on 
the right are even, then the inequality is strict. 


Proof: If a vertex in an arbitrary graph has R(p — 1, q) neighbors or R(p, q — 1) 
nonneighbors, then the graph has a p-clique or an independent q-set. With 
R(p—1,q)+R(p,q-—1) points altogether in the graph, the pigeonhole principle 
guarantees that one of these possibilities occurs. Equality in the bound requires 
a regular graph with R(p—1, q)+R(p, qg—1)—1 vertices. If both summands are 
even, this requires a regular graph of odd degree on an odd number of vertices, 
which is impossible. a 


Since R(p, 2) = R(2, p) = p, Theorem 8.3.11 yields R(p, q) < an *) (Ex- 
ercise 15). The lack of exact answers has led to study of asymptotics. "For fixed 
q and large p, R(p,q) < cp?! loglog p/ log p (Graver-Yackel [1968], Chung- 
Grinstead [1983]). For gq = 3, the answer is known within a constant factor: 


c'p®/logp < R(p, 3) < cp?/log p. 


The upper bound is due to Ajtai-Komlés—Szemerédi [1980]; the lower to Kim 
[1995]. All these bounds use probabilistic methods (Section 8.5). 

Ramsey numbers for equal quotas are called diagonal Ramsey numbers. 
Asymptotically, the upper bound of Ca . 2) for R(p, p) is c4?/,/p. Exercise 14 
presents a constructive lower bound that i is polynomial in p. The best known 
constructive lower bound grows faster than every polynomial in p but slower 
than every exponential in p (Frankl—Wilson [1981], Exercise 29). 

An exponential lower bound can be proved by counting methods. It yields 


/2 < liminf R(p, p)*/? < limsupR(p, p))/? < 4. 


Determination of this limit (and whether it exists) is the foremost open problem 
about Ramsey numbers. 


8.3.12. Theorem. (Erdés [1947]). R(p, p) > (eV2)~1p2?/2(1 + o(1)). 


Proof: Consider the graphs with vertex set [n]. Each possible p-clique occurs in 
2)-@) of these 2) graphs. Similarly, each p-set occurs as an independent set 
in 2@)-(G) of these graphs. Discarding this amount for each possible p-clique 
and each possible independent p-set leaves a lower bound on the number of 
graphs having no p-clique or independent p-set. 
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Since there are (" ) ways to choose p vertices, the inequality 2(" )2- G) <1 
thus implies R(p, p) > n. Rough approximations yield ("ya (3) < 1 whenever 


n < 2°/2, More careful approximations (using Stirling’s formula to approximate 
the factorials) lead to the result claimed. | 


GRAPH RAMSEY THEORY 


Ramsey’s Theorem for r = 2 says that k-coloring the edges of a large enough 
complete graph forces a monochromatic complete subgraph. A monochromatic 
p-clique contains a monochromatic copy of every p-vertex graph. Perhaps 
monochromatic copies of graphs with fewer edges can be forced by coloring a 
smaller graph than needed to force K,. For example, 2-coloring the edges of 
K3 always yields a monochromatic P3, although six points are needed to force a 
monochromatic triangle. This suggests many Ramsey number questions, some 
easier to answer than the questions for cliques. 


8.3.13. Definition. Given simple graphs G),...,G,, the (graph) Ramsey 
number R(G,..., G,) is the smallest integer n such that every k-coloring 
of E(K,) contains a copy of G; in color i for some i. When G; = G for all i, 
we write R,(G) for R(G,,..., Gx). 
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Burr [1983] determined R(G, G), &lled the “Ramsey number of G”, for all 
113 graphs with at most six edges and2no isolated vertices. Nice formulas are 
known for R(G;, G2) in some cases. Again our two colors are red and blue. 


8.3.14. Theorem. (Chvatal [1977]) If T is an m-vertex tree, then R(T, K,) = 
(m—Din—1) +1. 


Proof: For the lower bound, color Kim—1)m-1) by letting the red graph be 
(n -1)K,-1. With red components of order m — 1, there is no red m-vertex 
tree. The blue edges form an n — 1-partite graph and hence cannot contain K,,. 

The proof of the upper bound uses induction on each parameter, focusing 
on the neighbors of one vertex. Our presentation uses induction on n, invoking 
a property of trees that we proved in Chapter 2 by induction on m. The basis 
step is n = 1; no edges are needed to obtain K,. 

Given a 2-coloring of E(K gm-1)(n-1)+1), consider a vertex x. If x has more 
than (m — 1)(n — 2) neighbors along blue edges. tien the induction hypothesis 
yields a red T or a blue K,,; among thein. This yields a red T or a blue K,, 
(with x) in the full coloring. 

Otherwise, every vertex has at most (m — 1)(n — 2) incident blue edees and 
thus at least m — 1 incident red edges. This yields a red T, because every graph 
with minimum degree at least m — 1 contains T (Proposition 2.1.8). a 
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Whenever the largest component of G has m vertices and x(H) = n, the 
construction in Theorem 8.3.14 yields R(G, H) > (m — 1)(n — 1) +1 (Chvatal- 
Harary [1972]). Burr and Erdés [1983] conjectured that R(G, K, a): — le l(in- 
1) + 1 when m is sufficiently large relative to n(H) and maxrcg a - Although 
this holds (Burr [1981]) when G has many vertices of degree 2 and in some 
other cases, Brandt [2000] showed that for every nonbipartite graph H (such 
as K,) and everyh € R, there is a threshold dp such that R(G, H) > hn(G) for 
almost every.d-regular graph G with d > dp. 

In the upper bound for Theorem 8.3.14, it is crucial that the color classes 
in H are single vertices. When this fails, the lower bound can be very weak. 
When G = H = mks, for example, the Chvatal—Harary result yields R(G, H) > 
(3—1)(8—1)+1=5, but the correct value is 5m. Here the coloring for the lower 
bound is surprisingly asymmetric, considering the symmetry of the inputs. 


8.3.15. Theorem. (Burr—Erdés—Spencer [1975]) R(mK3, mK3) = 5m for m > 3. 


Proof: Let the red graph be K3n_~1 + K1,2m-1, a8 Shown below. Every triangle 
in this graph uses three vertices from the 3m — 1-clique, but the clique does not 
have enough vertices to make m disjoint triangles. The complementary blue 
graph is (Kom—1 + Ki) V K3m_1. Every blue triangle has at least 2 vertices in 
the copy of K2,,-1, so there cannot be m disjoint blue triangles. 
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For the upper bound, we use induction on m. Basis step: m = 2. This 
requires a case analysis that is fairly short if phrased carefully (Exercise 26). 

Induction step: m > 3. Since 5m > R(3,3) = 6, we know that every 2- 
coloring contains a monochromatic triangle. Discarding vertices of triangles 
as we find them, we can continue to find monochromatic triangles while at 
least six vertices remain. Since 5m — 3m > 6 for m > 3, we find m disjoint 
monochromatic triangles. If these all have the same color, then we are done. 

Otherwise, we have at least one triangle in each color. Let abc be a red 
triangle, and let def be a blue triangle disjoint from it. Of the nine edges 
between them, we may assume by symmetry that at least five are red. Some 
pair of these must have a common endpoint in def. 

Now we have a red triangle and a blue triangle with a common vertex; 
together they have five vertices. Since m > 2, the induction hypothesis for the 
coloring on the remaining 5m — 5 vertices yields (m — 1)K3 in one color. We add 
the appropriately colored triangle from the five special vertices. | 


Readers worried about the omission of the basis step in Theorem 8.3.15 
may consider coloring K,,. Avoiding 2K3 forces a bowtie (monochromatic tri- 
angles with a common vertex) as above, but then we find another monochro- 
matic triangle among the remaining six points. This completes a proof that 
R(mK3, mK3) < 5m + 1. Related results appear in Exercises 27-28. 
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We mention one remarkable result. The Ramsey number of an arbitrary 
graph may be exponential in the number of vertices, such as for K,,. Chvatal, 
Rédl, Szemerédi, and Trotter [1983] proved that for the class of graphs with 
maximum degree d, the Ramsey number grows at most linearly in the number 
of vertices! In other words, R(G, G) < cn(G), where c is a constant depending 
only on d. Of course, the constant is a fast-growing function of d, but it does 
not depend on n(G). The proof uses the Szeinerédi Regularity Lemma [1978], 
itself a difficult result with many applications. 


SPERNER’S LEMMA AND BANDWIDTH 


Although Sperner’s Lemma is not generally considered part of Ramsey the- 
ory, we include this material in this section because Sperner’s Lemma has the 
flavor of Ramsey theory: every labeling of a triangulation that satisfies certain 
boundary conditions contains a piece with a special labeling (one element from 
each class). Like Ramsey’s Theorem, Sperner’s Lemma uses very simple ideas 
but has subtle applications; Ramsey’s Theorem relies on the pigeonhole prin- 
ciple and induction, while Sperner’s Lemma uses only a parity argument (and 
induction for a generalization to higher dimensions). 


8.3.16. Definition. A simplicial subdivision of a large triangle T is a parti- 
tion of T into triangular cells such that every intersection of two cells is a 
common edge or corner. We call the corners of cells nodes. A proper la- 
beling of a simplicial subdivison of T assigns labels from {0, 1, 2} to the 
nodes, avoiding label i on the ith edge of T, fori € {0, 1, 2}. A completely 
labeled cell is a cell having all three labels on its corners. 


In a proper labeling, each label appears at one corner of 7, and label i 
avoids the edge of T joining the corners not labeled i. The figure below illus- 
trates a simplicial subdivision and the graph we will obtain from it to prove 
that it has a completely labeled cell. 


8.3.17. Theorem. (Sperner’s Lemma [1928]) Every properly labeled simplicial 
subdivision has a completely labeled cell. 
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Proof: We prove the stronger result that there are an odd number of completely 
labeled cells. We seek such a cell by beginning outside T and entering a cell by 
crossing an edge with labels 0 and 1. If we reach a cell whose third label is 2, 
we are finished. If not, then the third label is 0 or 1, and the cell has another 
0,1-edge. By crossing it, we enter a new cell and can continue looking for a cell 
with the third label. 

This suggests defining a graph G encoding the possible steps. We include a 
vertex for each cell plus one vertex for the outside region. Two vertices of G are 
adjacent if those regions share a boundary edge with endpoints labeled 0 and 
1. The graph on the right above results from the proper labeling on the left. 

A completely labeled cell becomes a vertex of degree 1 in G. A cell with no 0 
or no 1 becomes a vertex of degree 0. The remaining cells have corners labeled 
0, 0, 1 or 0, 1, 1 and become vertices of degree 2. Hence the desired cells become 
vertices of degree 1 in G, and these are the only cells that become vertices of odd 
degree. We have transformed the original problem into the problem of showing 
that G has such a vertex of degree 1. 

The vertex v for the outside region also has odd degree. As we travel from 
the 0-corner to the 1-corner along the edge of T that avoids label 2, we cross an 
edge of G involving v every time we switch from a 0 to a 1 or back again. Since 
we start with 0 and end with 1, we switch an odd number of times. Hence v has 
odd degree. Since the number of vertices of odd degree in every graph is even, 
the number of vertices other than v having odd degree is odd, so there are an 
odd number of completely labeled cells | 
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8.3.18. Application. Brouwer Fixed- Point Theorem. Brouwer’s Theorem (for 
two dimensions) can be interpreted as saying that a continuous mapping from a 
triangular region T to itself must have a fixed point. Suppose that the corners of 
T are the points (vectors) vo, v1, ve. Just as we can express a point on a segment 
uniquely as a weighted average of its endpoints, so we can express each v € T 
uniquely as a weighted average of the corners: v = agvp + a1v1 + agu2, where 
>> a; = 1 and each a; > 0 (Exercise 37). We can specify v by its vector of 
coefficients a = (do, aj, ag). 

Define sets So, 51, So from the mapping f by placing a ¢€ S; ifa; < a;, where 
f(a) =a’. Since the coefficients of each point sum to 1, every point in T belongs 
to some S;, and a point belongs to all three sets if and only if it is a fixed point 
for f. We want to show that the three sets have a common point. 

Given a simplicial subdivision of T, for each node a choose a label i such 
that a € S;. Points on the edge of T opposite v; have ith coordinate 0. Their ith 
coordinate cannot decrease under f, so we can choose a label other than i for 
each point on that edge. The resulting labeling is proper, and Sperner’s Lemma 
guarantees a completely labeled cell. Repeating the process using triangula- 
tions with successively smaller cells yields a sequence of successively smaller 
completely labeled triangles. Let the jth triangle have corners x;, y;, z; with 
labels 0,1,2, respectively. In each 5;, we obtain an infinite sequence of points. 

The remaining details are topological; we merely suggest the steps. Since 
f is continuous, each 5; is closed and bounded. Every infinite sequence of points 
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in a closed and bounded set has a convergent subsequence. Hence {x, x2, ...} 
has a convergent subsequence; let x;, be its kth entry. Because the distance 
from x;, to y;, and z;, approaches 0, these subsequences also converge to the 
same point. Since So, 51, Sp are closed and bounded, this limit point belongs to 
all three of them and is a fixed point of f. | 


We also apply Sperner’s Lemma to solve a problem on the “triangular grid”. 


8.3.19. Definition. When the vertices of G are numbered with distinct inte- 
gers, the dilation is the maximum difference between integers assigned 
to adjacent vertices. The bandwidth B(G) of a graph G is the minimum 
dilation of a numbering of G. 


Dilation is always minimized when there are no gaps in the numbering, 
but it can be convenient to allow gaps (Exercise 42). The name “bandwidth” 
comes from matrix theory; the optimal numbering describes a permutation of 
the rows and columns of the adjacency matrix so that the 1’s appear only in 
diagonal bands close to the main diagonal; arranging the matrix in this order 
can speed up computation of the inverse. Another motivation is to minimize the 
delay between adjacent vertices when the vertices must be processed in a linear 
order. Computation of bandwidth is NP-hard even for trees with maximum 
degree 3 (Garey—Graham—Johnson—Knuth [1978]). 

We present two fundamental lower bounds on bandwidth. 


8.3.20. Lemma. B(G) > maxycG we 


Proof: Every numbering of G contains a numbering of each subgraph of G. On 
every subgraph H, two numbers differing by at least n(H) — 1 are used. By 
the pigeonhole principle, some edge on a path between the two corresponding 
vertices has dilation at least n(H) — 1 divided by the distance between them. & 


8.3.21. Lemma. (Harper [1966]) B(G) > max; min{|dS|: |S| = k}, where 0S 
denotes the subset of vertices in a set S C V(G) that have at least one 
neighbor outside S. 


Proof: For every value of k, some set S of k vertices must be the first k vertices 
in the optimal numbering of G. The bandwidth of G must be at least |dS|, 
because the vertex among 05 that has the least label has an edge of dilation at 
least |dS| to its neighborhood above S. 1 


Chung [1988] named the first bound the local density bound. The com- 
putation of Harper’s bound is usually difficult. For the cube Q;, the value is 
ee, (i - ))- For the grid P,,0 P,, the value of Harper's lower bound is min{m, n}, 
which can be achieved (Exercise 43). 


8.3.22. Example. The triangular grid. The triangular grid 7; consists of ver- 
tices (i, j,k) such that i, j,k are nonnegative integers summing to /, with two 
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vertices adjacent if the total of the absolute differences in corresponding coor- 
dinates is 2. Below we show 7. Numbering the vertices by rows produces an 
upper bound of / + 1 for B(7,). This is optimal, but the local density bound is 
only about //2, and Harper’s bound is about //./2. Sperner’s Lemma can be 
used to prove that / + 1 is optimal. a 


Let G be the graph formed by a simplicial subdivision of a triangle. The. 
outer boundary of G is a cycle, the bounded regions are triangles, and the cycle 
is partitioned into three paths by the corners of the large triangle. We say that 
a connector is a vertex set inducingza connected subgraph that contains a 
vertex of each boundary path. 
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8.3.23. Lemma. (Hochberg-McDiarmid-Saks [1995]) Let T be a simplicial 
subdivision in which each vertex is assigned red or blue. Let R, B be the 
subgraphs induced by the red and by the blue vertices, respectively. For 
each such coloring, exactly one of R, B contains a connector. 


Proof: For each vertex v, consider the vertices reachable from v using vertices 
with the same color as v. If the three sides are not all reachable, label v with 
the smallest index of a side not reachable from v. For the vertices on the ith 
side, the label i does not appear. If there is no connector, then each node has a 
label, and this is a proper labeling of T. 

By Sperner’s Lemma, there is a completely labeled cell. Since the cell has 
three corners and we only used two colors R, B, two of the corners of this cell 
have the same color. Since they are adjacent, they can reach the same set of 
vertices in their color. Hence the least side unreachable from them cannot be 
different. This contradiction means that we could not have constructed the 
specified labeling. Hence there is a vertex from which every side is reachable. 

If one color has a connector, it partitions the remaining vertices into sets 
such that from each set at least one side is unreachable. Hence there cannot be 
connectors in both colors. | 


8.3.24. Theorem. (Hochberg-McDiarmid—Saks [1995]) Let G be a graph that 
triangulates a region bounded by a cycle C partitioned into three paths. If 
k is the minimum over v € V(G) of the sum of the distances from v to each 
of the three paths, then B(G) >k +1. 
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Proof: Let f be a numbering of G. Let t be the maximum index such that the 
subgraph induced by the vertices numbered 1, ..., t does not have a component 
meeting all three paths. Let R be this vertex set, let S be the set of vertices 
outside R having neighbors in R, and let T be the remaining vertices. 

By construction, the vertex v with f(v) = 1+ 1 belongs to S. Since R U {v} 
contains a connector, R U § contains a connector, and T does not. Since there 
is no edge between R and T and R contains no connector, R UT contains no 
connector. Now Lemma 8.3.23 implies that S contains a connector. The set S 
equals:a(S UT) for the terminal segment § UT in the numbering. Therefore, 
the numbering has difference at least |S| on some edge from S to R. 

A connector contains walks from each of its vertices to each of the three 
boundary paths. By hypothesis, the sum of the lengths of these walks from any 
fixed vertex is at least k. There exists a vertex in S for which these walks in S 
are disjoint paths. Hence |S| > k +1. a 
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8.3.25. Corollary. The triangular grid 7; has bandwidth / + 1. 


Proof: For each vertex (i, j,k) in 7), the distances to the three sides are /, j, k, 
respectively, so the sum of the distances is/. By Theorem 8.3.24, the bandwidth 
is at least / + 1, which we have observed is achievable. a 
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EXERCISES 


8.3.1. (—) Each of two concentric discs has 20 radial sections of equal size. For each 
disc, 10 sections are painted red and 10 blue, in.some arrangement. Prove that the two 
discs can be aligned so that at least 10 sections on the inner disc match colors with the 
corresponding sections on the outer disc. 


8.3.2. For n € N, let S be a set of n + 1 elements in {1,...,2n}. Prove that S has two 
elements with greatest common factor 1 and has two elements such that one divides the 
other. For each conclusion, exhibit a subset of size n where it does not hold; hence these 
conclusions are best possible. 


8.3.3. Use partial sums and the pigeonhole principle to prove the following statements. 
a) Every set of n integers contains a nonempty subset whose sum is divisible by n. 
(Also exhibit a collection of n — 1 integers with no such subset.) 
b) Given x € R, prove that at least one of {x, 2x,..., (n — 1)x} differs by at most 1/n 
from an integer. 


8.3.4. (!) A private club has 90 rooms and 100 members. Keys are given to the members 
so that any 90 members have access to the rooms in the sense that each of these 90 
members will have a key to a different room. (They do not share their keys.) Prove that 
at least 990 keys are needed and that 990 suffice. 
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8.3.5. Let T be a tree. Use the technique of Theorem 8.3.2 to prove that the center 
of T consists of one vertex or two adjacent vertices (this proves Theorem 2.1.13 again). 
(Jordan [1869], Graham—Entringer—Székely [1994]) 


8.3.6. Prove that every set of 2” + 1 integer lattice points in R” contains a pair of points 
whose centroid (mean vector) is also an integer lattice point. 


8.3.7. Prove that every 2-coloring of the integer lattice points in R” has a collection of n 
points with the same color whose centroid (mean vector) is an integer lattice point also 
having that color. (Hint: Ramsey’s Theorem is not needed; there is a short proof using 
only the pigeonhole principle.) (Bona [1990]) 


8.3.8. Let S be a collection of n + 1 positive integers summing to k. For k < 2n+1, prove 
that S has a subset with sum i for each i ¢€ [k]. For each n, exhibit a collection for which 
this fails when k = 2n + 2. 


8.3.9. For even n, construct an ordering of E(K,) so that the maximum length of an 
increasing trail is n — 1. (Comment: This proves that Theorem 8.3.4 is best possible 
when n is even. It also is best possible when n is odd and at least 9, but the construction 
is much more difficult.) (Graham—Kleitman [1973]) 


8.3.10. Let S be a set of nine points in the plane (no three collinear). Prove that S 
contains the vertex set of a convex 5-gon. Exhibit a set of eight points without this 
property. 
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8.3.11. (!) Let S be a set of R(m, m; 3) points in the plane no three of which are collinear. 
Prove that S contains m points that form a convex m-gon. (Tarsi) 


8.3.12. Recall that a digraph is simple if no two edges have the same ordered pair of 
endpoints. A monotone tournament is a tournament in which the orientation of the 
edges always agrees with the order of the indices on the vertices or always disagrees with 
that order. A complete loopless digraph has one copy of each ordered pair of distinct 
vertices as an edge. Given m, prove that if N is sufficiently large, then every simple 
loopless digraph with vertex set [NV] has an independent set of order m or a monotone 
tournament of order m or a complete loopless digraph of order m. 


8.3.13. (!) Schur’s Theorem. (Schur [1916]) 

a) Given k > 0, prove that there exists a least integer s, such that every k-coloring 
of the integers 1,..., 5, yields a monochromatic x, y, z (not necessarily distinct) such 
that x + y = z. (Hint: Apply Ramsey’s Theorem for r = 2.) 

b) Prove constructively that s, > 35,_1 — 1 and hence that s; > (3* — 1)/2. 


8.3.14. (!) The composition or lexicographic product of two simple graphs G and 
H is the simple graph G[H] whose vertex set is V(G) x V(H), with edges given by 
(u,v) = (u’, v’) if and only if (1) uv’ is an edge of G, or (2) u = uv’ and vv’ is an edge of H. 

a) Prove that a(G[H]) = a(G)a(H). _ 

b) Prove that the complement of G[H] is G[H]. 

c) Use parts (a) and (b) to prove by construction that 


R(pq +1, pqt+1)—12> [R(p+1,p+1)-1) x [R@+1,¢+1)- 11. 


d) Deduce that R(2” + 1,2” + 1) > 5" +1 for n > 0 and compare this lower bound 
to the nonconstructive lower bound for R(k, k). (Abbott [1972]) 


8.3.15. (—) Verify that R(p, 2) = R(2, p) = p. Use this and Theorem 8.3.11 to prove 
that R(p, q) < (?577) 


p-1 
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8.3.16. (—) Use the graph below to prove that R(3, 5) = 14. 


8.3.17. Ramsey numbers for r = 2 and multiple colors. 
a) Let p = (pi,..., Px), and let g; be obtained from p by subtracting 1 from p; but 
leaving the other coordinates unchanged. Prove that R(p) < Ea R(qi) —k +2. 


b) Prove that R(pi + 1,..., pr +1) < See 


8.3.18. Let r, = R,(3; 2) (this is the value of n such that k-coloring E(K,) forces a 
monochromatic triangle). 

a) Show that r;, < k(r-1 — 1) +2. ‘ 

b) Use part (a) to show that r;, < [kle] +1, so that rs < 17. (Comment: r3 = 17, 
but the lower bound requires a clever 3-coloring of Ki, that arises from the finite field 
GF (2"*)). _ 


8.3.19. Prove that R,(p;r +1) <r+k", where M= (Rein), 


/m 


8.3.20. (+) Off-diagonal Ramsey numbers.” 

a) Prove that R(k, 1) > nif (7) pt) + ("ya _ p)(2) <1 for some p € (0,1). Prove that 
R(k, 1) > n — (") p@ — (")(1 — p)@) for alln e Nand pe (0, J). 

b) Use part (a) to prove R(3,k) > k3/2+°, What lower bound on R(3, k) can be 
obtained from the first part of (a)? (Spencer [1977]) 

c) Use part (a) to obtain a lower bound for R,(q). 


8.3.21. (!) Determine the Ramsey number R(Ki,, Ki,.). (Hint: The answer depends on 
whether m and n are even or odd.) 


8.3.22. (!) Let T be a tree with m vertices. Given that m — 1 divides n — 1, determine 
the Ramsey number R(T, K,,,). (Burr [1974]) 


8.3.23. If p > (m—1)(n—1), prove that every 2-coloring of Z(K,) in which the red graph 
is transitively orientable contains a red m-clique or a blue n-clique, and prove that this 
is best possible. (Brozinsky—Nishiura) (Hint. Use perfect graphs.) 


8.3.24. Show that R(T, Kn,,..., Kn) = (m — 1)(R(m1, ...,%) — 1) + 1 when T is a tree 
with m vertices. (Burr) 


8.3.25. Prove that R(C4,, C4) = 6. (Comment: There are many proofs.) 


8.3.26. Prove that R(2K3,2K3) = 10. (Hint: Reduce to the case of a bowtie with trian- 
gles of both colors plus monochromatic 5-cycles; then use symmetry.) 
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8.3.28. (!) For 1 <i < k, let G; be a graph on p; vertices, and fix a multiplicity m;. Prove 
that R(m,G),...,mx¢Gx) < > (m; — 1)p; + R(Gi,..., Gx). 


8.3.29. Frank] and Wilsbn [1981] explicitly constructed graphs with n vertices that have 


no clique or independent set with size exceeding 2°V !8"!°8!°8", where c is a particular 
constant. Prove that this gives a lower bound for R(p, p) that grows faster than every 
polynomial in p but slower than every exponential in p. 


8.3.30. (!) For every simple graph G, determine R(P3, G) as a function only of the num- 
ber of vertices of G and the maximum size of a matching in G. 


8.3.31. (!) Let r and s be natural numbers with r +s #4 0 mod 4. Prove that every 
2-coloring of E(K,.,) has a monochromatic connected graph with at least [r/2] + [s/2] 
vertices. Conclude that every 3-coloring of E(K,.;) contains a monochromatic connected 
subgraph with more than (r + s)/2 vertices. Show that this fails when 4 dividesr +s. 


8.3.32. Forcing 4-cycles. 
a) Prove that if ) >, ey) (“5”) > ("S”), then G contains a 4-cycle. 


b) Prove that if e(G) > (1 + ./4n(G) — 3), then G contains a 4-cycle. 
c) Prove that R,(C4) < k? +k +2. (Chung—Graham [1975]) 


8.3.33. (!) Bondy [1971a] proved that x # y implies d(x) + d(y) = n(G), then G = 
K,, or G has a cycle of each length from 3 to n. Use this to prove that R(Cm, Kin) = 
max{m, 2n + 1}, except possibly if m is even: and at most 2n. (Lawrence [1973]) 


8.3.34. (!) Prove that every 2-coloring of E( K, ) has a Hamiltonian cycle that is monochro- 
matic or consists of two monochromatic paths. (Hint: Use induction on n.) (Lovasz [1979, 
p85, p482 - attributed to H. Raynaud]) 


8.3.35. (+) Let f be a 2-coloring of E(K,), and suppose that k > 3. Prove the following: 

a) If f has a monochromatic C41, then f also has a monochromatic Co. 

b) If f has a monochromatic C;, then’ =f also has a monochromatic Cy,_; or 2K,. 

c) If m > 5, then R(C,,, Cn) < 2m —.1 (see Exercise 8.3.25 for m = 4). (Hint: Use 
parts (a) and (b) and the result of Erdés—Gallai [1959] (Theorem 8.4.35) that e(G) > 
(m — 1)(n(G) — 1)/2 forces a cycle of length at least m in G. There remains one difficult 
case). 
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8.3.36. The Ramsey multiplicity of G is the minimum number of monochromatic 
copies of G in a 2-coloring of the edges of a clique on R(G, G) vertices. Show that the 
Ramsey multiplicity of K3 is 2. 


8.3.37. Prove that each point in-a triangular region has a unique expression as a convex 
combination of the vertices of the triangle (convex combinations are linear combinations 
where the coefficients are nonnegative and sum to 1). 


8.3.38. Sperner’s Lemma in higher dimensions. A k-dimensional simplex consists of 
the convex combinations of k + 1 points in R* not lying in a hyperplane. A simplicial 
subdivision expresses a k-dimensional simplex as a union of k-dimensional simplices 
(cells) such that any two cells intersect in the simplex determined by their common cor- 
ners. A completely labeled cell has {0, ..., k} at its corners. 

State a general definition of “proper labeling” so that every proper labeling of a sim- 
plicial subdivision of a k-simplex contains a completely labeled cell. Prove this theorem. 
(Hint: The proof of Sperner’s Lemma in two dimetisions (Theorem 8.3.17) is an instance 
of the induction step for a proof by induction on k.) 


8.3.39. (—) Compute the bandwidths of P,, K,, and C,. 
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8.3.40. Compute the bandwidth of K,,,_..,,. (Eitner [1979]) 


8.3.41. (!) Prove that every tree with k leaves is the union of [k/2] pairwise intersecting 
paths (Exercise 2.1.37). Use this to prove that the bandwidth of a tree with k leaves is 
at most [k/2]. (Ando—Kaneko—Gervacio [1996]) 


8.3.42. (+) Let G be a caterpillar (Definition 2.2.17), and let m be an integer such that 
[ 3 | < m for all H C G. Prove that B(G) < m. (Hint: Prove that G has a numbering 
f in which f(v) is a multiple of m whenever v is on the spine and | f(u) — f(v)| < _m for 


allu < v.) (Systo-Zak [1982], Miller [1981]) 


8.3.43. Bandwidth of grids. 

a) Compute the local density lower bound for the bandwidth of P,, 0 P,. 

b) Let S be a k-set of vertices in P,P, with a; vertices in the ith row and b; vertices 
in the jth column. Prove that |dT| < |d5| if 7 is the set consisting of the first a; vertices 
in the ‘th row for each i. 

c) Prove that |d5| is minimized over k-sets in V(P, 0 P,) by some S such that a; > 
++» > a, and b; > -.- > b,. Conclude that Harper’s lower bound for B(P, 0 P,) isn. 

d) Conclude that B(P,, 0 P,) = min{m, n}. (Chvatalova [1975]) 


8.3.44. (+) Let G be a simple graph with order n and bandwidth b. 

a) For e € G, prove that B(G +e) < 2b. 

b) Prove that ifn > 6b, then B(G + e) can be as large as 2b. 
(Comment: The maximum of B(G +e) isb+1ifn < 3b+4 and is [(n — 1)/3] if38b+5 < 
n < 6b — 2.) (Wang—West—Yao [19295]) 
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8.4. More Extremal Problems 
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Extremal graph theory is a huge area. In Section 1.3 we described the 
distinction between optimization problems (find an extremal structure in the 
input graph) and extremal problems (find an extremal instance over a class 
of graphs), and we have studied both types of problems throughout this book. 
In this section we study the latter type. The archetypal example is the Turan 
problem: find the maximum number of edges in a graph not containing H asa 
subgraph. We list one additional example from each chapter. 


Objective Class of graphs Answer Reference _ 
maxe(G) _ n vertices and k components Ce) Exercise 1.3.40 
max girth diameter k and not a tree 2k+1 Exercise 2.1.61 
max B(G) a’(G)<k 2k Exercise 3.3.10 
mina(G) «(G) =k and diameter d [(d+1)/2] Exercise 4.2.22 
max x(G) 2Ko-free and w(G) =k ($’) Exercise 5.2.11 
max x(G) outerplanar 3 Exercise 6.3.12 
maxe(G) n(G)=nandnon-Hamiltonian _ ("3’) +1 Exercise 7.2.26 
maxn(G) w(G) < panda(G) <q R(p,q)—1 Section 8.3 


With such enormous variety of extremal problems, we can only hope in this 
section to exhibit a small sample of interesting results. 
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We first consider parameters related to three types of graph encoding. 
Each model of encoding involves assigning vectors to vertices, and the param- 
eter is the minimum length of vectors that suffice. We study the maximum of 
this parameter over n-vertex graphs. The parameters are intersection number, 
product dimension, and squashed-cube dimension. 


8.4.1. Definition. An intersection representation of length ¢ assigns each 
vertex a 0,1-vector of length t such that u < v if and only if their vectors 
have a 1 in acommon position. Equivalently, it assigns each x € V(G) a set 
S, © [t] such that u < v if and only if §,9S, 4 ©. The intersection num- 
ber 0’(G) is the minimum length of an intersection representation of G. 


The elements of [rt] in a representation correspond to complete subgraphs 
that cover E(G). This motivates our use of 9’ for intersection number: 0(G) is 
the minimum number of cliques needed to cover V(G). 


8.4.2. Proposition. (Erdés—Goodman—Pésa [1966]) The intersection number 
equals the minimum number of complete subgraphs needed to cover E(G). 


Proof: We define a natural correspondence between representations of length t 
and coverings of E(G) by t complete subgraphs. Each i € [t] generates a clique 
{v € V(G): i € S,}. The resulting compjete subgraphs cover E(G), since u = v 
if and only if 5,9 S, 4 @. 

Conversely, if complete subgraphs.Q;,..., Q; cover E(G), then assigning 
{i: v € V(Q;)} to each vertex v yields an intersection representation. | 


Pam. 


Hence 6/(G) = e(G) if G is triangle-free, and 6’(K{n/2),(n/2)) = |n?/4|. In 
fact, this is the unique n-vertex graph maximizing 0’(G). Exercise 1 suggests a 
direct proof of the bound; here we present a stronger result. 

Let F be a family of graphs. For an input graph G, the F-decomposition 
problem is to decompose G into the minimum number of graphs in F. When 
F is not closed under taking subgraphs, F-decomposition may require more 
subgraphs than F-covering. For example, we can cover the kite with two 
complete subgraphs, but three complete subgraphs are needed to decompose it. 

Proving 6’(G) < |n?/4]| for n-vertex graphs means showing that every n- 
vertex graph can be covered with |n?/4| complete subgraphs; we prove the 
stronger result that there is always a decomposition using at most this many 
complete subgraphs. In fact, we can find such a decomposition greedily. 


8.4.3. Theorem. (McGuinness [1994]) Every greedy clique decomposition of an 
n-vertex graph uses at most |n?/4| cliques. 


Proof: We use induction on n. The claim is obvious for n < 2; consider n > 2. 
Let Q = Q1,..., Om be a greedy decomposition of G, meaning that each Q; isa 
maximal complete subgraph in G — U;.;E(Q;). Note that deleting Q; from the 
list Q leaves a greedy decomposition of G — E(Q)). 
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If each Q; has at least three edges, then m < n?/6, so we may assume that 
some Q; is an edge xy. Let R consist of the elements of Q—{Q;} that are incident 
to x, and let S consist of those incident to y. The set Q’ = Q— (RUSU{Q;}) isa 
greedy decomposition of a subgraph of G — x — y. By the induction hypothesis, 
|Q’| < (n — 2)?/4. Hence it suffices to prove that |R| + |S| <n — 2. 

We prove this by choosing distinct vertices in V(G) — {x, y} from the vertex 
sets of the elements of R U S. Since each edge is deleted exactly once, each v ¢ 
{x, y} appears once in R if v € N(x) and once in S if v € N(y). Consider Q € R. 
If Q uses a vertex v ¢ N(y), then we choose such a v for Q. If V(Q) € Ny), 
then we choose for Q a vertex v € Q such that vy belongs.to the earliest element 
of Q that contains both y and some vertex of Q. Call this element Q’; note that 
Q’ is the only element of S containing v. Since Q and xy are maximal when 
chosen, Q’ precedes both of these in Q. For elements of S, choose vertices by 
reversing the roles of x and y. 

We have shown that if v belongs to some Q € R and to some Q’ € S, and 
v is chosen for one of them, then the one for which it is chosen occurs after 
the other one in the list Q. Hence no vertex is chosen twice. We conclude that 
[Rl +|S| <n —2and m < n?/4. . a 


Sot 


o 
Pal 
a y 
o 


OTc 


Both Chung [1981] and Gyéri—Kostochka [1979] strengthened the decom- 
position bound, proving that every n-vertex graph has a decomposition into 
complete subgraphs whose orders sum to at most |n(G)?/2|. 

Now we consider the second encoding model. 


8.4.4. Definition. A product representation of length ¢ assigns the vertices 
distinct vectors of length ¢ so that u < v if and only if their vectors differ in 
every position. The product dimension pdim G is the minimum length 
of such a representation of G. 


By devoting one coordinate to each e € E(G), in which the vertices of e have 
value 0 and other vertices have distinct positive values, we obtain pdim G < 
e(G) (if G is not a complete graph). 


8.4.5. Example. Every complete graph has product dimension 1. For K,, each 
pair of vertices must agree in some coordinate, but we cannot assign two ver- 
tices the same vector. Hence two coordinates are needed, and assigning (0, /) 
to v; for each j suffices. 

For K,+ K,-1, the vectors for the clique must differ in each coordinate. The 
vector for the isolated vertex must agree with each of the others somewhere, 
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but it cannot agree with more than one in a single coordinate. Hence at least 
n — 1 coordinates are needed. This suffices, by using (1,2,...,n — 1) for the 
isolated vertex and (i,i,...,i) for the ith vertex of the clique. a 


Again we can describe the parameter using complete graphs. 


8.4.6. Definition. An equivalence on G is a spanning subgraph of G whose 
components are complete graphs. 


8.4.7. Proposition. The product dimension of G is the minimum number of 
equivalences Ej,..., E, such that J E; = G and (| E; =@ 

Proof: Again there is a natural bijection. Given a product representation, the 

ith coordinate generates E;, with a component for each value used in the ith 

coordinate. Every nonadjacent pair agrees in some coordinate, so every edge of 

G is covered. 


Conversely, given £;,..., E;, each component of E; becomes a fixed value 
in the ith coordinate of a representation. The requirement NE; = @ is the 
requirement of using distinct vectors in the product representation. a 


8.4.8. Lemma. If x’/(G) > 1, then pylim G < x'(G), with equality if G is 
triangle-free. 


book 


Proof: Every matching is a disjoint union of complete graphs and becomes an 
equivalence by the addition of isolated vertices; hence x/(G) equivalences cover 
G. If x’( '’(G) > 1, then these equivalences have no common edge. 
If G is triangle-free, then every equivalence used in a cover of G is a match- 
ing plus isolated edges, and thus x/(G):< pdim G. a 
8.4.9. Corollary. For n > 3, the maximum product dimension of an n-vertex 
graph is n — 1. 


Proof: Let G be an n-vertex graph. By Lemma 8.4.8 and Vizing’s Theorem 
(Theorem 7.1.10), pdim G < x'/(G) < A(G) +1 <n. Furthermore, the bound 
improves to n — 1 unless A(G) = n—1. Let S be the set of vertices of degree 
n —1in G; we may assume that |S| =k > 1. 

By Lemma 8.4.8 and Vizing’s Theorem again, pdim (G — $) <n—k. By 
duplicating coordinates if needed, we obtain a product representation of G — § 
of length n — k. Let x' be the vector assigned to v; in this representation. 

Each vertex of S is isolated in G. We now assign to each v € S the vector 
whose ith coordinate, for 1 < i < n —k, is the ith coordinate of x’. If k = 1, 
then this completes a representation of G with length n — 1. If k > 1, then we 
have assigned the same vector to all of 5; we add one coordinate using distinct 
values to complete a representation of length n — k + 1, which is less than n — 1. 

Since pdim (K, + K,_1) =n — 1 (Example 8.4.5), the bound is sharp. | 


Lovasz—NeSsetril—Pultr [1980] characterized the n-vertex graphs with prod- 
uct dimension n — 1 (Exercise 4). They also proved a general lower bound using 
a dimension argument in linear algebra. 
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8.4.10. Theorem. (Lovasz—NeSetril—Pultr [1980]) Let u1,...,u, and vj,..., v, 
be two lists of vertices (not necessarily distinct) in a graph G. Ifu; — v; 
fori = j andu; # v; fori < j, then pdim G > [lgr]. 


Proof: Let G have a representation of length d. Let x!,...,x” and y!,..., y’ 
be the vectors for u1,...,u,; and U1,...,U,, respectively. The vectors x' and y' 
differ i in pa cootiinste, but x! and yf agree in some coordinate if i # 7. Hence 
[ti — y)) is nonzero if and only if i = j. 

We use this product property to construct r linearly independent vectors 
in R”’; this proves that r < 27 and hence that pdim G > [lgr]. Expansion of 
Tt (wy — 2%) for w, z € R? yields the sum Discta | lies Wi lhe zj). To relate 


r to 2¢, we view this as a dot product in R®’, with coordinates indexed by the 
subsets of [d]. For each w € R’, define two vectors i in R™ by setting Ws = []j<5 wi 
and ws = [|,,;(—w:) for the eootdinate S C [d]. With this definition, the dot 
product w wz eeabte T]p—1 (wx — 2%). The conditions on the x’s and y’s thus imply 
that x' - ¥/ is nonzero if and only it =j, 

We claim that ¥',...,%” are independent. Consider a linear dependence 
>)-1 cix' = 0. Taking the dot product of j” with both sides kills all terms below 
i =r, yielding c,x’ - y’ = 0. Since x’ - ¥’ 4 0, we have c, = 0. We can now apply 
the same argument using §’—!. Knowing that c, = 0 yields c,1x" 1. p71 = 0. 
Successively decreasing the index yields cj = 0 for all j. We conclude that 
x1,...,x” are independent, which requires 24 > Yr, |] 


ye/me 


8.4.11. Example. Matchings: pdim (n/2) Kz = [lgn]. Given k coordinates, the 
graph encoded by using all 2* binary k- tuples as codes is 2‘! Ky, since only with 
its complement does a vector disagree in each position. If n is not a power of 2, 

then we can discard complementary pairs to obtain a construction. The lower 
bound follows from Theorem 8.4.10, using each vertex in each list (for example, 
set uji= Un41—i) a 


In our third encoding model, we want to recover more detailed informa- 
tion: distance between vertices. This arises from an addressing problem in 
communication networks. Each message should travel a shortest path to its 
destination. Without central control, a vertex receiving a message must deter- 
mine where to send it using only the name of the destination. If the vectors for 
two vertices yield the distance between them in G, then a vertex can compare 
the destination vector with the vectors for its neighbors and send the message 
to a neighbor closest to the destination. 

For a connected graph G, we want to assign vectors to vertices such that 
the distance between vertices is the number of positions where their vectors 
differ. This is an isometric or “distance-preserving” embedding of G into 
H=K,,0---OK,,, meaning a mapping f: V(G) > V(#) such that dg(u, v) = 
dy(f(u), f(v)). However, many connected graphs have no isometric embedding 
in a cartesian product of cliques; Co,.1 for k > 2 is an example (Exercise 11). 

Hence we introduce a “don’t care” symbol +. Let S = {0, 1, *}, and define a 
symmetric function d by d(0, 1) = 1 and d(0, *) = 0 = d(1, *). Let S” denote 
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the set of N-tuples (vectors) with entries in S, and for a,b € S% let ds(a, b) = 
> d(a;, b;). For each graph G, we obtain for some N an encoding f: V(G) > S* 
so that dg(u, v) = ds(f(u), f(v)) for all u, v € V(G). 

Each a € S% corresponds to a subcube of Qn, the N-dimensional cube; 
the dimension of the subcube is the number of *s in a. For a,b € S%, the 
minimum distance between vertices of the corresponding subcubes is d;s(a, b). 
The vectors assigned to distinct vertices correspond to disjoint subcubes, else 
their distance would be 0. If we contract the edges of each assigned subcube, 
we obtain a “squashed cube” H. The distance-preserving map f: V(G) > S*% 
is an isometric embedding of G in H. 


8.4.12. Definition. A squashed-cube embedding of length N is a map 
f: V(G) > S% such that dg(u, v) = ds(f(u), f(v)). The squashed-cube 
dimension qdim G is the minimum length of such an embedding of G. 


8.4.13. Example. The vectors 000, 001, 01x, and 1** form a squashed-cube 
embedding of K, with length 3. Two adjacent vertices of the 3-cube remain 
unchanged, an edge adjacent to both collapses, and the entire opposite face 
collapses. The resulting graph is K,. The image subcubes appear below in bold. 
The construction generalizes to embed K,, in a squashed n — 1-dimensional cube. 

The path P, embeds isometrically in Q,-1 without squashings, using 
00---00, 10---00, 11---00,...,11---10, 11---11. No shorter embedding ex- 
ists, because the distance between the endpoints of P, is n — 1, and each coor- 
dinate contributes at most 1 to the distance between vectors. a 
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8.4.14. Proposition. For a graph G, qdim (G) < )°;_ j 4G (Ui, vj). 


Proof: For each pair i, j withi < j, we dedicate a block of dg (v;, v;) coordinates. 
Set these coordinates to 0 for v;, to 1 for v;, and to * for other vertices. Given 
two vertices, the only coordinates where neither contains * are the coordinates 
dedicated to the pair, so dg(v;, vj) = ds(f (vi), f (v;)). a 


Using an eigenvalue technique (Exercise 8.6.14), Graham and Pollak [1971, 
1973] proved a general lower bound on qdim (G) that yields qdim K, =n — 1. 
Hence K,, and P, both have squashed-cube dimension n — 1; Graham and Pollak 
conjectured that qdim G < n — 1 for every n-vertex connected graph. Graham 
offered $100 for a proof, and Winkler found an encoding scheme to prove this 
“Squashed-Cube Conjecture”. 
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Winkler’s proof generates an explicit n — 1-dimensional squashed-cube en- 
coding for each connected n-vertex graph G. We begin by indexing the vertices; 
choose vp arbitrarily. Next, find a spanning tree T such that dr (v, v9) = dg(v, vo) 
for all v € V(G) (T can be generated by a breadth-first search from ug). Now, 
number the vertices by a depth-first search in T. In other words, having chosen 
the indexing for vo, ..., v;, let vj,1 be an unvisited child of v; in T, if one exists; 
otherwise backtrack toward the root until‘a vertex with such a child is found. 
The resulting indices increase along every path from vp in T. 


8.4.15. Example. Depth-first numbering of a breadth-first spanning tree. Be- 
low, the bold edges belong to T and the solid edges to G — T. We will use this 


example to illustrate several steps in the proof. a 
| 2 
0 
3 4 5 


We henceforth fix T and this ordering and refer to vertices by their index 
in this ordering. Let P; be the vertex set of the i, 0-path in T, let i’ be the parent 
of i in T (the next vertex on the path from i to 0), and let i A j = max(P; N Pj) 
be the vertex at which the i, 0-path and j,0-path meet. Given a depth-first 
numbering of a breadth-first tree T in-G, let c(i, j) = dr(i, j) — dgti, j) be the 
discrepancy of two vertices i, j. Bs 


teleg 


8.4.16. Example. In the graph marked c(i, k) below, we record at each vertex 


k the discrepancy c(i, k) for the tree T in Example 8.4.15. | 
0 0 0 Oo 0 O 
0 0 0 1 1 1 1 3 8 
1 1 1 8 1 8 
0 0 0 0 0 0 0 0 0O 


8.4.17. Lemma. (Winkler [1983]). Discrepancy has the following properties. 
a) c(i, j) =c(j,i) > 0. 
b) Ifi €.P;, then c(i, 7) = 0. 
c) If neither i ¢€ P; nor j € P;, then c(i, j’) < c(i, j) < c@, j’) +2. 
Proof: (a) Distance in graphs is symmetric, and the shortest i, j-path in G is no 
longer than the path between them in 7. (b) The preservation of distances to 
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vo implies that the i, j-path in T is a shortest i, j-path in G. (c) Since j’ belongs 
to the i, j-path in T, we have dr(i, j) — dr(i, j’) = 1. Since jj’ € E(G), we have 
ld, 7) — de (i, j)|_< 1. Thus c(i, j) — c(i, j’) is 0, 1, or 2. a 

i 
J 
iAj 
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With this notion of discrepancy, we can give an overview of how Winkler’s 
encoding works. We use a search tree because it gives us n — 1 natural coordi- 
nates. Distance in the tree is an “approximation” to distance in the graph; it 
needs to be adjusted (reduced) by the discrepancy. Winkler’s encoding puts a 1 
in coordinate k for exactly one of vertices i and j for exactly dr(i, j) values of 
k. We want the other code to have a 0 in exactly dg(i, j) of these coordinates, 
so we perform the adjustment by having « in exactly c(i, j) of the coordinates 
where one code has a 1. The problem is to design the encoding to achieve this 
simultaneously for all pairs of vertices. 


4 


8.4.18. Theorem. (Winkler [1983]) Every connected n-vertex graph G has 
squashed-cube dimension at most n — 1. 


Proof: Choose a tree T and numbering 0,...,2 — 1 as described above. We 
define an encoding f (i) = (fi(i),..., faai(i)) and verify that dg (i, j) = ds(a;, a;). 
The encoding is 3 


1 ifke P; 

* ifc(i,k) —c(i,k’) =2 
fii) = 4 * ifcG,k) —c(i,k’) =1landi <k and c(i,k) is even 
* ifc(i,k) —c(i,k’) =landi >k and c(i, k) is odd 


0 otherwise 


(The vectors in the encoding for Example 8.4.16 are f(0) = 00000, f(1) = 
10000, f(2) = 1100, f(3) = +0100, f(4) = **110, and f(5) = **111.) 

To prove that ds(f (i), f(j)) = dg(i, j), we count the coordinates where one 
of f (i), f(j) has a 1 and the other has a 0. Such coordinates k belong to P; U P;, 
where all the 1’s are located. By symmetry, we may assume that i < j. Hence 
j ¢ P;, and we consider the two cases i € P; andi ¢ Pj. 

Ifi € P;, then dg(i, j) = dr(i, j) = |P; — P|, and f.(i) = fc(j) = 1 if and 
only if k € P;. The coordinates where exactly one of f(i), f(j) has a 1 all lie in 
P; — P;. Fork € P; — P;, we have f, (i) = 0, and thus dg(i, j) = ds(f (i), f(/)). 

Ifi ¢ P;, then exactly one of { f;(k), f;(k)} equals 1 precisely when k € (P; — 
P;) U (P; — P;). We need to prove that the other vector has « in exactly c(i, j) of 
these coordinates. This will yield 


ds(f@), FG) = |Fj — Pil + |P: — Bil - eG) = ari, f) — Gi, 7) = de, J). 
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In Example 8.4.16, (Ps — P2:) U (P2 — Ps) is all five coordinates; since f (2) and 
f() together have * in three of these coordinates, we have ds(f (2), f(5)) = 
dg (2, 5) = 2, as desired. 

To locate the x’s in these positions, consider the change in discrepancies as 
we bring either of i, j to the point where P,;, P; meet. Consider two lists: 


O=cli,iAj) <:--<ci,j) < ct, J) 
O=cUiAj,j) <---<ci', Jj) < cG, j). 


We will obtain one * in f(i) for each even m with 0 < m < c(i, j) and one x, in 
f(j) for each odd m with 0 < m < c(i, j). . 

For even m with 0 < m < c(i, j), let jm be the unique vertex such that 
C(i, jm) = m and c(i, j,,) < m. Even when the value m is not in the first list, j,, 
is well-defined. Because c changes by at most 2 with each step, the values of 
jm are distinct. Furthermore, the depth-first ordering guarantees i < k for all 
k € P; — P;. Thus f,(i) = * fork € P; — P; if and only ifk = j,, for some even m. 
In Example 8.4.16; for (i, 7) = (2, 5) we have jo = 4 and f4(2) = *. 

Similarly, for odd m with 0 < m < c(i, j), let im be the unique vertex such 
that c(in, j) = m and c(i,,, j) < m. As before, the values of i,, are distinct and 
well-defined. The depth-first ordering guarantees j > k for all k € P; — Pj, so 
aj(k) = * fork € P; — P; ifand only ifk = i,, for some odd m. In Example 8.4.16, 
for (i, j) = (2, 5) we have i, = 1, ig = 2,and fi(j) = fa(j/) = *. 

Thus, we have counted the +’s in-P; — P; U P; — P;. Their number is the 
number of even integers between 1 and c(i, j) plus the number of odd integers 
between 1 and c(i, j), which together equals c(i, j). a 
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BRANCHINGS AND GOSSIP 
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We have studied the problem of finding the maximum number of pairwise 
edge-disjoint spanning trees in a graph; this equals the maximum k such that 
for every vertex partition P, there are at least k(|P| — 1) edges crossing be- 
tween sets of P (Corollary 8.2.59). Here we consider an analogous problem for 
digraphs that is related to Menger’s Theorem (Exercise 14). Menger’s Theorem 
is a min-max theorem that focuses on vertex pairs. We examine “connected- 
ness” from a single vertex to the rest of the digraph. 


8.4.19. Definition. An r-branching in a digraph is a rooted tree “branching 
out” from r. Vertex r has indegree 0, all other vertices have indegree 1, and 
all other vertices are reachable from r. Let «’(r; G) denote the minimum 
number of edges whose deletion makes some vertex unreachable from r. 


Deleting the edges entering a set X C V(G) — {r} makes each vertex of X 
unreachable from r. On the other hand, a minimal set whose deletion makes 
some vertex unreachable includes all edges leaving the set of reached vertices. 
Hence «’(r; G) equals the minimum, over nonempty X € V(G) — {r}, of the 
number of edges entering X. 
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In a set of pairwise edge-disjoint r-branchings, each must use at least 
one edge entering X. Thus there are at most «’(r; G) pairwise edge-disjoint 
r-branchings in G. Edmonds proved that this bound is achievable. Our discus- 
sion allows multiple edges. 


8.4.20. Theorem. (Edmonds’ Branching Theorem [1973]) For a vertex r in a 
digraph G, the maximum number of pairwise edge-disjoint r-branchings 

in Gisxk’(r; G). 

Proof: (Lovasz [1976]) Let V be the vertex set of G. The upper bound holds 
since each subset of V — r is entered by at least one edge in every r-branching. 
We prove the existence of «’(r; G) edge-disjoint r-branchings by induction on 
k=«'(r; G). For k = 1, a breadth-first search suffices to grow an r-branching, 
since every vertex is reachable. For k > 1, we seek an r-branching T such that 
K'(r; G— E(T)) =k —1; the induction hypothesis then supplies k — 1 additional 
r-branchings. 

A partial r-branching is an r-branching of an induced subgraph of G. Let 
T be a partial r-branching of maximum order such that «’(r; G— E(T)) >k—-1. 
The vertex r itself is such a branching, with E(T) = @. Let S = V(T). IfS=V 
then we are done, so we may assume that S 4 V. 

For X C V —,r, let ex denote the number of edges in G — E(T) that enter 
X. Ife, >k for every X C V —r that intersects V — S, then we can extend T by 
adding any edge from S to V —S. Hence we can choose a smallest set U C V—r 
that intersects V — S and is entered by exactly k — 1 edges. (In the illustration, 
T consists of the solid edges.) 

Because x«'(r; G) = k and we havetleleted no edge entering U — S, we still 
have ey_s > k. However, ey = k — 1, so ) there must be an edge xy from SMU to 
U—S. Weclaim that xy can be added to enlarge 7, contradicting the maximality 
of T. We need only verify that at least k — 1 edges still enter each W C V —r 
when we delete xy from G — E(T). This holds trivially unless x € V — W and 
y € W. It suffices to prove that ew > k for such a W. 

The quantity ew + ey counts edges entering W and entering U. Except for 
the edges between U — W and W — U, these enter W U U, and those entering 
WU are counted twice. Thus ew + ey > ewuu +ewny. We have ewuy >k-1 
by the defining property of T, ey = k — 1 by construction, and ewny > k by 
x € U — W and the minimality of U. Hence ew > k -1—-(k-1) +k =k, as 
desired. a 


— 
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Lovasz’s proof can be converted to an algorithm for finding the maximum 
number of pairwise disjoint r-branchings; Tarjan [1974/75] gave another algo- 
rithm. We might call «’(r; G) the local-global edge-connectivity. Theorem 
8.4.20 has several equivalent forms: 


8.4.21. Corollary. If G is a directed graph, r is a vertex of G, and k > 0, then 
the following statements are equivalent. 
A) G has k pairwise edge-disjoint r-branchings. 
B) k'(r; G) >.k; equivalently, |[X, X]| > k for all X © V(G) — {r}. 
C) For each s + r there exist k pairwise edge-disjoint r, s-paths. 
D) There exist k pairwise edge-disjoint spanning trees of the underlying 
(undirected) graph that for each s 4 r contain among them exactly k edges 
of the digraph G entering s. 


Proof: AB is Edmonds’ Theorem, B }+C is Menger’s Theorem, and A> D 
is immediate. For D=> B, assume that the trees exist and consider U C V — r. 
Each spanning tree has at most |U| — 1 edges within U, so the trees together 
have at most k(|U| — 1) edges within U. By hypothesis, the edges of the digraph 
G corresponding to these trees contain exactly k |U| edges with heads in U, so 
at least k edges enter U. | 


Schrijver observed that Edmonds’ Branching Theorem can also be proved 
using matroid union and matroid intersection. Discard the edges entering the 
root r. Let M, be the union of k copies of the cycle matroid on the underlying 
undirected graph. Let M2 be the matroid in which a set of edges is independent 
if and only if no k + 1 of them have the same head (this is the direct sum of 
uniform matroids of rank k). There exist k disjoint r-branchings if and only if 
these two matroids have a common independent set of size k(n(G) — 1). 


Pairwise edge-disjoint r-branchings provide a fault-tolerant static protocol 
for message transmissions from r; alternative trees are available. We next 
consider a static protocol for transmissions from each vertex to every other. 
Each transmission is two-way, but they are performed in a specified order. 

The resulting question is the gossip problem. Consider n gossips, each 
having a tidbit of information. Being gossips, each wants to know all the infor- 
mation, and when two communicate they tell each other everything they know. 
How many telephone calls are needed to transmit all the information? Several 
solutions were published in the early 1970s. 

Succeeding with 2n — 3 calls is easy: everyone calls x, and then x calls 
everyone back, saving one call by combining the last call in and first call out. 
When n > 4, 2n — 4 calls suffice: first the others call in to a set S of four people, 
then S shares the information in two successive pairings, and finally the others 
receive calls back from S, using a total of (n — 4) +4+(n —4) = 2n — 4 calls. 
Using a graph model, we show that this is optimal. 


8.4.22. Definition. An ordered graph is a graph with an ordering of the 
edges (multiple edges allowed). An increasing path is a path via suc- 
cessively later edges. A gossip scheme is an ordered graph having an 
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increasing path from each vertex to every other vertex. A gossip scheme 
satisfies NOHO (“No One Hears his or her Own information”) if it has no 
increasing x, y-path plus a later edge between x and y. 


8.4.23. Theorem. For n > 4, the minimum number of edges in a yossip scheme 
on n vertices is 2n — 4. 


Proof: (Baker—Shostak [1972]). We freely use “calls” in place of “edges” to 
emphasize the ordering and the possibility of repeated edges. The scheme de- 
scribed above uses 2n — 4 calls, and case analysis shows that it is optimal for 
n = 4, This provides the basis for a proof by induction on n. For n > 4, we may 
assume that every gossip scheme with n — 1 vertices uses at least 2n —6 calls. If 
2n — 4 is not optimal for n vertices, then we can add calls to the optimal scheme 
(if necessary) to obtain an n-vertex gossip scheme G with exactly 2n — 5 calls. 

Claim 1. G satisfies NOHO. Otherwise, G has an increasing path from x 
to v, along edges e1,..., e, followed by a call e,4,; = yx. Delete e; and e41. 
Partition the other calls involving x into k + 2 sets: Eo consists of those before 
e1, E; for 1 <i < k consists of those between e; and e;,1, and E,41 consists of 
those after e,,;. In each edge e € E;, replace x by v1, v;, or 4; in the cases i = 0, 
1 <i <k, ori =k +1, respectively (see illustration). Now E(G) — {e;, ex+1} 
is a gossip scheme on V(G) — {x}, because every increasing path through x 
is replaced by an increasing path that consists of the same edges and perhaps 
additional edges from {e;}. The scheme has 2(n — 1) — 5 edges, which contradicts 
the induction hypothesis. : 


telegram.me/ 


Ey 
v1 
EL, E Eo 
1 £o e2 V1 \e2 
E2 vg => v2 E>» 
E3 E, €3 Ey v3 4&3 
U3 
E3 


Claim 2. d(x) —3 calls are useless to x, and hence 5(G) > 3. Let O(x) be the 
set of calls on which some vertex is reached for the first time by an increasing 
path “Out” from x; these calls form a tree. The tree /(x) of edges useful “In” to 
x is O(x) for the reverse order on E(G). We show that O(x) N /(x) is the set of 
edges incident to x. If an increasing x, y-path reaches y € N(x) before the edge 
xy, then x violates NOHO; hence xy € O(x). Similarly, xy € /(x). Conversely, if 
O(x)M1(x) contains an edge e not incident to x, then an increasing path from x 
containing e and an increasing path to x containing e combine to violate NOHO 
for x. Hence |O(x) N/(x)| = d(x). The edges “useless to x” are those not in 
O(x) U I(x). We have 


|O@ UTE eee hi )4ednaaey= 3. 


Since this counts a set of edges, 5(G) > 3. 
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Claim 3. The subgraph obtained by deleting the first call and the last call 
made by each vertex has at least five components and has no isolated vertex. Let 
xy be the first call involving x. If the first call involving y is yz with z 4 x, then 
by definition it occurs before xy, and these two calls do not communicate from 
x to z. After yz and xy, an increasing x, z-path violates NOHO at z. Hence the 
set F of first calls is a matching, and there are n/2 of them. Similarly, the set L 
of last calls is a matching of size n/2. The graph G — F — L hasn — 5 edges and 
hence at least five components, by Proposition 1.2.11. It has no isolated vertex, 
since 5(G) > 3. 

The contradiction. Since e(G) = 2n — 5 < 2n, some vertex x has degree 
at most 3. Let C,, Co, C3 be the components of G — F — L containing x, its 
first neighbor, and its last neighbor, respectively (its middle neighbor is in Cj). 
Edges of G — F — L can belong to O(x) only via paths that start with the first or 
middle edge involving x and avoid F U L, so they belong to C; or C2. Similarly, 
edges of G — F ~ L belonging to /(x) appear only in C, or C3. The edges of 
the remaining components, of which there are at least two, are useless to x 
(G — F — L has no isolated vertex), but Claim 3 allows only d(x) — 3 = 0 edges 
useless to x. s 


In practical applications, we might wish to minimize the total length of 
the messages or the total time (assuming that each vertex participates in at 
most one call per time unit). We can also restrict the pairs that are allowed to 
call each other. Gossiping can be completed in 2n — 4 if and only if the graph 
of allowable calls is connected and has a 4-cycle (Bumby [1981], Kleitman- 
Shearer [1980]). Other variations congider digraphs (Exercises 15-16), fault- 
tolerance, conference calls, etc. 2 


LIST COLORING AND CHOOSABILITY 


List coloring is a more general version of the vertex coloring problem. We 
still pick a single color for each vertex, but the set of colors available at each 
vertex may be restricted. This model was introduced independently in Vizing 
[1976] and Erdés—Rubin—Taylor [1979]. 


8.4.24. Definition. For each vertex v in a graph G, let L(v) denote a list of 
colors available at v. A list coloring or choice function is a proper col- 
oring f such that f(v) € L(v) for all v. A graph G is k-choosable or list 
k-colorable if every assignment of k-element lists to the vertices permits 
a proper list coloring. The list chromatic number, choice number, or 
choosability y;(G) is the minimum k such that G is k-choosable. 


Since the lists could be identical, x,(G) > x(G). If the lists have size at 
least 1+ A(G), then coloring the vertices in succession leaves an available color 
at each vertex. This argument is analogous to the greedy coloring algorithm 
and proves that x;(G) < 1+ A(G) (see Exercise 22 for other analogues with 
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x(G)). It is not possible, however, to place an upper bound on x;(G) in terms of 
x(G); there are bipartite graphs with arbitrarily large list chromatic number. 


8.4.25. Proposition. (Erdés—Rubin—Taylor [1979]) If m = ( le then Kin.m is 
not k-choosable. 


Proof: Let X, Y be the bipartition of G = Ky». Assign the distinct k-subsets 
of [2k — 1] as the lists for the vertices of X, and do the same for Y. Consider a 
choice function f. If f uses fewer than k distinct choices in X, then there is a k- 
set § C [2k — 1] not used, which means that no color was chosen for the vertex 
of X having S as its list. If f uses at least k colors on vertices of X, then there 
is a k-set S C [2k — 1] of colors used in X, and no color can be properly chosen 
for the vertex of Y with list S. a 


List chromatic number is more difficult to compute than chromatic num- 
ber, because the statements of the upper bound and lower bound both involve 
universal quantifiers. Determining the 3-choosable complete bipartite graphs 
was difficult. For 3 < m <n, Km, is 3-choosable if and only if 


m = 3 andn < 26 (Erdés—Rubin—Taylor [1979]), or 

m = 4andn < 20 (Mahadev—Roberts—Santhanakrishnan [1991]), or 
m = 5 andn < 12 (Shende—-Tesman [1994]), or 

m = 6 andn < 10 (O’Donnell [1995)). 


Alon and Tarsi [1992] used a polynomial associated with a graph to obtain 
upper bounds on x;(G) (see also Alon {1993]). Fleischner and Stiebitz [1992] 
used the technique to prove that adding n disjoint triangles to a 3n-cycle yields 
a 3-colorable graph; they proved the stronger result that it is 3-choosable. 

There is also an edge-coloring variant, where we assign lists to the edges 
and must choose a proper edge-coloring. 


8.4.26. Definition. Let L(e) denote the list of colors available for e. A list 
edge-coloring is a proper edge-coloring f with f(e) chosen from L(e) for 
each e. The edge-choosability x;(G) is the minimum k such that every 
assignment of lists of size k yields a proper list edge-coloring. Equivalently, 
xXi(G) = x:(L(G)), where L(G) is the line graph of G. 


The argument for x’(G) < 2A(G) — 1 also yields x/(G) < 2A(G) — 1 (Ex- 
ercise 22) and thus x;(G) < 2x’(G). As in ordinary coloring, the use of line 
graphs expresses the edge version as a special case of the vertex version, and 
it behaves much better. Even so, the conjectured bound for edge-choosability 
is surprising. It was suggested independently by many researchers, includ- 
ing Vizing, Gupta, Albertson, Collins, and Tucker, and it seems to have been 
published first in Bollobas—Harris [1985] (see also Bollobas [1986]). 


8.4.27. Conjecture. (List Coloring Conjecture) x/(G) = x'(G) for all G. a 


For simple graphs, this conjecture and Vizing’s Theorem (Theorem 7.1.10) 
would yield x;(G) < A(G) +1. Bollobas and Harris [1985] proved that x;(G) < 


410 ‘ Chapter 8: Additional Topics 


cA(G) when c > 11/6 for sufficiently large A(G). This and later improvements 
used probabilistic methods. Kahn [1996] proved the conjecture asymptotically: 
x(G) < (1+ 0(1))A(G). Haggkvist and Janssen [1997] sharpened the error 
term: x;(G) < d+ O(d2/3 /logd) when d = A(G). Molloy and Reed [1999] 
further sharpened (and generalized) the bound. 

The special case of the List Coloring Conjecture for G = K,,,, was posed by 
Dinitz in 1979. (Janssen [1993] proved it for K,,,_1.) The Dinitz Conjecture 
became popular in its matrix formulation: If each position of an n by n grid 
contains a set of size n, then it is possible to choose one element from each set 
so that the elements chosen in each row are distinct and the elements chosen 
in each column are distinct. 

Galvin [1995] proved the List Coloring Conjecture for bipartite graphs, 
which includes the Dinitz Conjecture (see also Slivnik [1996]). Here we prove 
only the Dinitz Conjecture, using the Stable Matching Problem (Section 3.2). 


8.4.28. Definition. A kernel of a digraph is an independent set S having a 
successor of every vertex outside S. A digraph is kernel-perfect if every 
induced subdigraph has a kernel. Given a function f: V(G) — N, the 
graph G is f-choosable if a proper coloring can be chosen from the lists 
at the vertices whenever |L(x)| = f (x) for each x. 


h_bo¢ 


We used the concept of “kernel” in Application 1.4.14 (digraphs without 
odd cycles, for example, have kernels). An f-choosable graph is k-choosable for 
k = max f(x), since adding colors toa ligt cannot make the choice more ditficult. 


8.4.29. Lemma. (endy-Boppana-SigveD If D is a kernel-perfect orientation 
of G and f(x) = 1+ d(x) for all x € V(G), then G is f-choosable. 


Proof: We use induction on n(G); the statement is trivial for n(G) = 1. For 
n(G) > 1, consider an assignment of lists, with the list L(x) having size f(x). 
Choose a color c appearing in some list. Let U = {u: c € L(v)}. Let S be the 
kernel of the induced subdigraph D[U]. Assign color c to all of S, which is 
permissible since S is independent. 

Delete c from L(v) for each v € U — S. Delete additional colors arbitrarily 
from other lists to reduce L(x) for each x € V(D)—S to size f’(x), where f’(x) = 
1+ _,(x). Since each vertex not in S has a successor in S, we have f’(x) < 
f(x) for x € V(D) — S, which accommodates the deletion of c from the lists. 
By the induction hypothesis, D’ is f’(x)-choosable, so we can complete a list 
coloring for G by adding a list coloring of D’ to the use of c on S. a 


8.4.30. Theorem. (Galvin [1995]) x/(Ki.n) =n. 


Proof: Since x;/(G) = x:(L(G)), it suffices by Lemma 8.4.29 to prove that L(K,,,) 
has a kernel-perfect orientation with each vertex having indegree and outde- 
gree n — 1. The graph L(K,,,,) is the cartesian product K,,0 K,, (Exercise 7.1.8); 
placed in an n by n grid, vertices are adjacent if and only if they are in the same 
row or in the same column. 
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Assign labels 1, 2,..., So that vertex (r, s) has label r-+s—1 mod rn. Define 
an orientation D of K, 0 K, by directing edges from vertex (r,s) with label i to 
the vertices in column s with lower labels and the vertices in row r with higher 
labels. Since i is higher than i — 1 other labels, (7, s) has i — 1 successors in-its 
column and n — i successors in its row. Hence dt (r,s) = d7(r,s) =n —1. 

We prove that D is kernel-perfect. Given U C V(D), we obtain a kernel 
for the subdigraph D[U] by solving a stable matching problem. When (r, b) € 
U and (r,s) — (r,b) in D, we want r to prefer b to s. Thus for row r, the 
preferences among columns begin with {s: (r,s) € U} in decreasing order of 
vertex labels, followed by any order among {s: (r,s) ¢ U}. Similarly, for column 
s, the preferences among rows begin with {r: (r,s) € U} in increasing order of 
vertex labels, followed by any order among {r: (r,s) ¢ U}. 

The Gale—-Shapley Proposal Algorithm (Algorithm 3.2.17) yields a stable 
matching M for these preferences. Viewing the matched pairs in M as positions 
in the grid, let S = MU. Because M is a matching, S has no two positions 
in the same row or column; hence S is an independent set in D. We show that 
each x € U — S has a successor in S. 

Let i be the label of position x = (r,s) « U — S. Since S = MNU, we have 
x ¢ M. Thus M has a position y = (r, b) with some label j and a position z = 
(a, s) with some label k. Because M is stable, we cannot have both r preferring 
s tob ands preferring r to a. From this‘statement we deduce by the steps below 
that x has y or zasasuccessorinS. <= a 


m.me/math 


not [(r prefers s to b) and (s prefers r to a)] 
not [(y¢U or i> jf) and (¢¢U or i <k)] 
(yéU andi <j) or (cE€U and i>k) 

(x > yé S) or (x > zeES) 


8.4.31. Remark. The List Coloring Conjecture relates to another conjecture. 
A total coloring of G assigns a color to each vertex and to each edge so that 
colored objects have different colors when they are adjacent or incident. The 
Total Coloring Conjecture (Behzad [1965]) states that every simple graph G 
has a total coloring with at most A(G) + 2 colors. Rosenfeld [1971] and Behzad 
[1971] provide results on special classes. The List Coloring Conjecture would 
yield an upper bound of A(G) + 3, since every graph G has a total coloring with 
at most x;(G) + 2 colors (Exercise 25). a 


The List Coloring Conjecture has been studied for planar graphs. Elling- 
ham and Goddyn [1996] proved that every k-regular k-edge-colorable planar 
graph is k-edge-choosable (using the Four Color Theorem). 

The discussion of planar graphs brings us back to list coloring of vertices. 
Although planar graphs have chromatic number at most 4, Vizing [1976] and 
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Erdés—Rubin—Taylor [1979] conjectured that the maximum choice number on 
this class is 5. Voigt [1993] constructed a non-4-choosable planar graph with 
238 vertices; Mirzakhani [1996] (Exercise 26) reduced this to 63 vertices (both 
examples generalize to infinite families). In fact, there are 3-colorable planar 
graphs that are not 4-choosable (Gutner [1996], Voigt—Wirth [1997]). 

Thomassen [1994b] proved the upper bound (and also [1995] that planar 
graphs of girth 5 are 3-choosable). Often in inductive proofs for planar graphs, 
the vertices on the unbounded face (“external vertices”) play a special role. 


8.4.32. Theorem. (Thomassen [1994b]) Planar graphs are 5-choosable. 


Proof: Adding edges cannot reduce the list chromatic number, so we may re- 
strict our attention to plane graphs where the outer face is a cycle and every 
bounded face is a triangle. By induction on n(G), we prove the stronger result 
that a coloring can be chosen even when two adjacent external vertices have 
distinct lists of size 1 and the other external vertices have lists of size 3. For 
the basis step (n = 3), a color remains available for the third vertex. 

Now consider n > 3. Let v,, v1 be the vertices with fixed colors on the 
external cycle C. Let v1, ...,v, be V(C) in clockwise order. 

Case 1: C has a chord v;v; with 1 <i < j -2 < p-—2. We apply the 
induction hypothesis to the graph consisting of the cycle v,..., vj, vj,.-., Up 
and its interior. This selects a proper coloring in which v,, v; receive some fixed 
colors. Next we apply the induction hypothesis to the graph consisting of the 
cycle v;, vj41,..., vj and its interior to complete the list coloring of G. 

‘Case 2: C has no chord. Let v;, U1,...,Um, v3 be the neighbors of v2 in order 
(3 = p is possible). Because bounded Paces are triangles, G contains the path 
P with vertices v), u1,...,Um, v3. Since C is chordless, uj,...,u,, are internal 
vertices, and the outer face of G’ = G — ve is bounded by a cycle C’ in which P 
replaces v1, v2, v3. 

Let c be the color assigned to v;. Since |L(v2)| = 3, we may choose distinct 
colors x, y € L(v2) — {c}. We reserve x, y for possible use on v2 by forbidding x, y 
from uy,...,Um. Since |L(u;)| > 5, we have |L(u;) — {x, y}| => 3. Hence we can 
apply the induction hypothesis to G’, with uj, ..., u» having lists of size at least 
3 and other vertices having the same lists as in G. In the resulting coloring, 
vy and uy,...,Um have colors outside {x, y}. We extend this coloring to G by 
choosing for ve a color in {x, y} that does not appear on v3 in the coloring of G’. & 


Case 1 Case 2 
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PARTITIONS USING PATHS AND CYCLES 


We have considered the F-decomposition problem: partitioning E(G) 
into the minimum number of subgraphs in a family F. This has been studied 
for many families F, such as cliques (Theorem 8.4.3), bipartite graphs (Exercise 
3), complete bipartite graphs (Theorem 8.6.20), stars (vertex cover number— 
Section 3.1), and forests (arboricity—Corollary 8.2.57). Before considering ex- 
tremal problems for decomposition of graphs into paths and cycles, we discuss 
an easier problem: covering the vertices of a digraph using the fewest paths. 

Comparability graphs are those having transitive orientations; a digraph 
is transitive if x — y and y > z imply x — z. The vertices of a path in 
a transitive digraph induce a tournament. Comparability graphs are perfect 
(Proposition 5.3.25), meaning that a transitive digraph D in which the largest 
tournament has w vertices can be properly w-colored. By the Perfect Graph 
Theorem (Theorem 8.1.6), we also know that V(D) can be covered using a(D) 
tournaments in D, where a(D) is the maximum size of an independent set. 

Letting paths be “chains” and independent sets be “antichains”, this be- 
comes Dilworth’s Theorem for transitive loopless digraphs: The maximum size 
of an antichain equals the minimum number of chains needed to partition V(D). 
In addition to following from the Perfect Graph Theorem, Dilworth’s Theorem 
is equivalent to the Kénig—Egervary Theorem (Exercise 27), and a generaliza- 
tion of it follows from the Matroid Intersection Theorem (Exercise 8.2.50). Here 
we present a further generalization that has a short and self-contained proof. 


8.4.33. Theorem. (Gallai—Milgram [1960]) The vertices of a digraph D can be 
covered using at most a(D) pairwise disjoint paths. 


Proof: Since V(D) can be covered using n disjoint paths of length 0, it suffices 
to prove a stronger claim: If C is a set of pairwise disjoint paths covering V(D), 
and S is the set of sources (initial vertices) of these paths, then V(D) can be 
covered using at most a(D) pairwise disjoint paths with sources in S. The proof 
is by induction on n(D), with a trivial basis step for n(D) = 1. The added 
statement about the sources helps the induction step work. 

Suppose that n > 1 and that C is a covering of V(D) by k paths with source 
set S. The claim holds unless |C| = k > a(D), in which case we construct a 
cover using fewer paths, all with sources in S. Since k > a, there exists an 
edge xy with x, y € S. Let A and B be the paths in C starting with x and y, 
respectively. We mty assume that A has an edge xz, else we could add x to the 
beginning of B and save one path. 

By deleting x from the start of A, we obtain a cover C’ of V(D — x) by k 
paths having sources in S’ = § —x +z. Since a(D — x) < a(D), the induction 
hypothesis yields a cover C” of V(D — x) using fewer than k paths, all with 
sources in 5S’. All elements of S’ belong to S except z. 

If z is the source of a path in C”, then we add x at the beginning of that 
path. If z is not a source but y is, then we add x at the beginning of the path 
starting with y. If neither y nor z is a source, then at most |S’| -2 = k —2 paths 
have been used. and we can add x as a path by itself to obtain the desired cover 
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of V(D) using k — 1 paths. In all cases, the resulting paths are pairwise disjoint 
and have sources in S. 

By repeating this argument as long as k > a, we can reduce the number of 
paths to a. | 


We return to the decomposition problem. Gallai conjectured that every n- 
vertex graph can be decomposed using [n/2] paths. Equality holds for cliques 
(Exercise 28). Other graphs have fewer edges, but the lack of connections could 
require more paths. Hajés conjectured analogously that an n-vertex even graph 
can be decomposed into [n/2| cycles. Both conjectures remain open, but Lovasz 
proved the optimal bound when both paths and cycles are allowed. The size of 
a decomposition is the number of subgraphs used. 


8.4.34, Theorem. (Lovasz [1968b]) Every n-vertex graph can be decomposed 
into [n/2| paths and cycles. g 


Proof: Let F be the family of all paths-and cycles, and let n'(G) be the number 
of non-isolated vertices in a graph G. By induction on A(G) = 2e(G) — n'(G), we 
prove that G has an F-decomposition of size at most |n'(G) /2 | . Each component 
of G with more than one edge contributes positively to 4(G). Hence 4(G) > 0, 
with equality only when each nontrivial component is an edge. The claim holds 
with equality when A(G) = 0. 

In the induction step, A(G) > 0. We consider two cases. Case 1: If G has a 
vertex y of positive even degree, choose x € N(y), and let W = {z € N(x): d(z) is 
even}. In this case, let G’ = G — {xz: z € W}. In obtaining G’, we lose at least 
one edge (xy) and we isolate at most one vertex (x), so A(G’) < A(G). Case 2: If 
G has no vertex of positive even degree, then A(G) > 0 forces A(G) > 1. Let x 
be a vertex of degree at least 3, and form G* by introducing a new vertex y to 
subdivide an edge xx’. Let W = {y}, and let G’ = Gt — xy. Now e(G’) = e(G), 
but n'(G’) > n'(G), 80 A(G’) < A(G). 

In each case, the induction hypothesis yields an F-decomposition D of G’ 
with |D| < [n’(G’) 12 ie We convert D into an F-decomposition of size |D| for 
the graph H obtained from G’ by adding edges from x to W. In Case 1, H =G 
and n'(G’) < n'(G), so this is the desired decomposition. In Case 2, H = Gt 
and n'(G’) = n'(G*). Since n'(G) is even, |n'(G)/2| = |n'(G*t)/2|. In an F- 
decomposition of G*, the n'(G) vertices of odd degree must all be endpoints of 
paths; thus the added vertex y of degree 2 cannot be the end of a path. This 
means that xy and yx’ belong to the same subgraph and can be replaced by xx’ 
to obtain the desired decomposition of G. 

The two cases now combine; we need only obtain the decomposition of H 
from D. Let U = Ny(x). Every vertex of U has odd degree in G’, so for each 
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u € U there is a path P(u) in D with endpoint u. For u € W, we would like to 
extend P(u) to absorb ux. This cannot be done if P(u) reaches but does not end 
at x, since then the subgraph P(u) Uux is not in F. The idea is to cut the edge 
u’x on which P(u) reaches x, use the path P(u) Uux — u’x, and use u’x to extend 
P(u’) instead. This generates a sequence of changes from each u € W. We must 
show that the sequences terminate and do not conflict with each other. 


Let W = w1,..., w,. For w; € W, we form a list u?, u?,--- with u? = w; and 
each u/ € U. If in the ith list we have chosen a vertex u/, we check whether x 
is an internal vertex of P(u! ). If not, then we stop and do not define ul +1 Tf so, 
then we set u/** to be the vertex on P(u’) just before x; this is the “u’” suggested 
above. The path P(u! ) Ha j = 1 cannot start along the edge u; /x, because that 
edge is internal to P(uj~ 4) (Our picture of G’ shows three successive paths: 
P(u®) solid, P(uj) dashed, P(u?) dotted.) 

We prove next that no vertex of U: appears twice in the lists. Since xu! E 
E(G’) if j > 1, the vertices of W appear only as initial vertices. Let u! ui be a 
repeated vertex bi min{ j, J} 1} minimal, we have shown that j,/ > 0. By min- 
imality, u/~ # u'-!, and hence the paths P(u/~*) and Puy 1) start at distinct 
vertices. If u’ = ul, then the two paths share the edge Ui; ie x and “— be the 
same path. This happens from distinct vertices only if u/ ’ and ul! - 1 are op- 
posite ends of the path, but then they cannot both visit u! before x. Hence no 
repetition occurs. 

Let W’ = {u? }. Ifu= ui ' and u is not the end of its list, let u’ = = y! . We 
define an F-decomposition of G consisting of one path or cycle Q’ een 
to each Q € D. If Q # P(u) for some u € W’, let Q’ = Q. If Q = Pu), let 
Q' = Q+ux or QO’ = Q + ux — u'x depending on whether u is or is not the last 
vertex in its list. Always Q’ is a path, except that Q’ is a cycle when Q ends 
at x (and then uv’ is not defined). The union of the new paths corresponding to 
{P(u?)} is the same as UP(u/), except that the edges {xw;} are absorbed. Since 
u € W’ appears only once in the lists, the edge ux winds up in only one of the 
new paths, and {Q’: Q € D} is a decomposition of H. a 


“o./ 


Note that in this proof Q may be the selected path from each of its end- 
points u,v € W’. This is not a problem, because the adjustments to Q made 
from the two ends do not conflict. The path may visit x (thus defining wv’ and v’) 
or not, as sketched below. 
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u u’ v’ vd => u u! vu’ v 
x 4 
CIRCUMFERENCE 


When a sufficient condition for Hamiltonian cycles fails slightly, we might 
expect that the graph still must have a fairly long cycle. The length of the 
longest cycle in G is the circumference c(G). We first consider the number of 
edges needed to force a cycle of length at least c in an n-vertex graph. In this 
section, P(v, w) denotes the v, w-portion of a path P containing v and w. Also 
P, Q denotes the concatenation of paths P and Q when the last vertex of P is 
the first vertex of Q@. 


8.4.35. Theorem. (Erdés—Gallai [1959]) For m > 2, every simple n-vertex 
graph with more than m(n — 1)/2 edges has a cycle of length more than m. 


Proof: (Woodall [1972]) We use induction on n for fixed m. When n = m +1, 
fewer than (n — 1)/2 edges are missing, so 5(G) > n/2 and G is Hamiltonian. 
Suppose that n > m+1 and c(G) < m. If d(x) < m/2, then e(G—x) > m(n—2)/2. 
Applying the induction hypothesis to G — x yields c(G — x) > m. Hence we may 
assume that 5(G) > m/2. Similarly, we may assume that G is connected. 

Among all longest paths in G, choose P = v1,..., v; to maximize the degree 
d of v1; since G is connected, we have v; # v; (otherwise an edge from V(P) to 
V(G) — V(P) would yield a longer cycle). Let W = {v;: v1 <— v;41}. All neighbors 
of v1 lie on P, so |W| =d. For u, € W, the path P (vx, v1), V1 Uk41; P(vp41; v;) 
also has length /; hence N(uy,) © V(P), and the choice of P yields d(v,) < d. 
Furthermore, no y,, € W has a neighbor v; such that j > m, because then we 
could complete the long cycle by adding v;v, to P( vz, v1), V1 ¥p41, P(ve41, v;). 

By limiting the edges incident to W, we force many edges into G — W. 
Let Z = {v1,...,u,}, where r = min{l,m}. For each uy € W, we have shown 
that N(v;,) € Z. Hence there are |[W, Z — W]| + e(G[W]) edges incident to W. 
For fixed degree-sum in W, this is maximized when [W, Z — W] is a complete 
bipartite graph. We further maximize by letting each vertex of W have degree d. 
The resulting count is = 5 |W| (d+|Z — W|) = dr/2 < dm/2. Therefore, G— W has 
n — d vertices and mitre than m(n —d — 1)/2 edges. By the induction hypothesis 
c(G — W) > m. (If the number of edges forced into G — W is too large to exist, 
then this case cannot occur, and an earlier case applies.) a 


v1 Uk -Uk41 Ul 
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Most sufficient conditions for Hamiltonian cycles have “long cycle” ver- 
sions. The long cycle version of Dirac’s Theorem says that a 2-connected graph 
G has a cycle of length at least min{n(G), 25(G)} (Dirac [1952b]). Requiring 
2-connectedness eliminates the example K; V 2K; with circumference 6 + 1. 

The long cycle version of Ore’s Theorem [1960] came much later. It is im- 
plicit in Bondy [1971b] and was made explicit in Bermond [1976] and in Linial 
[1976]. The fundamental argument used in many long cycle results appears 
in Bondy [1971b]. It strengthens the Ore/Dirac switching argument (Theorem 
7.2.8) by considering “gaps”. 


8.4.36. Lemma. (Bondy [1971b]) If P = v,...,v, is a longest path in a 2- 
connected graph G, then c(G) > min{n(G), d(v1) + d(y)}. 

Proof: (See also Linial [1976]). Let m = d(v,) + d(v;), and suppose that c(G) < 

min{n(G), m}. Since G is connected, an /-cycle would yield a longer path; thus 

v1 # y. If vy <> v; and v; + v, for somei < j, then i, j is a crossover with gap 

j —i. Ifwe add v1; and wv; to P(j,1) and P(i, 1), we obtain a cycle with length 

1—(j -—i-—1). Hence! — (j —i — 1) < m wheni, j is a crossover. 


x =v] U; Vj y=y 


Thus x and y have no neighbors between v; and v; on P. Also N(y) contains 
no predecessor on P of a neighbor of x, since an /-cycle yields a longer path. 
Hence N(y) lies in V(P) — {y} but avoids {vj41,..., vj-2} and {v,-1 : u, © x}. 
Thus d(y) < (J — 1) — (j -2—i) — d@). Since / - (j —i — 1) < m, we have 
d(x) +d(y) < m, which contradicts the hypothesis. Hence there is no crossover. 

With t9 = max{i : x <= v;} and u = min{i : y = v;}, we have proved that 
to <u. We will construct a cycle containing x and y and all their neighbors. 
Since the absence of crossovers implies that |N(x) N N(y)| < 1, such a cycle has 
length at least d(x) +d(y)+1>m. 

We iteratively define paths P,, Po,---. Given t;_1, we choose integers s; < 
tj-1 < t; to maximize t; such that G has a v,,, v;,-path P; internally disjoint from 
P. Such a path exists because G — v,_; is connected. These paths are disjoint; 
if P; shares a vertex with a later path P;, then we can choose P; as an 5;, t;-path, 
which contradicts the maximality of t;. Similarly, s;,; > t;-1, since otherwise 
P;,, would be chosen instead of P,;. 

Let r be the smallest index such that t, > u. Set 


a=min{j:x <@ v; andj > 53}, b = max{j: y  v; and j <1t,}. 


Since 5s; < fo and t, > u, the indices a, b are well-defined. We use the even- 
indexed paths P; to build one x, y-path and the odd-indexed paths to build 
another x, y-path. When r is odd, the two paths are formed by the following 
concatenations. 


XUq, P(a, $2), Po, P(t, $4); P4, sees Pig, b), Upy 


P(1, 51), Pi, P(t, 53), Ps, P(ts, 55),...5 P-, P(t,, 0) 
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P P,_1 
a t Sg to P1 ub 
Oe 
P, P3 P, 


When r is even, the path starting with xv, reaches t, and ends with P(t,, 1), 
while the other path reaches v, and ends with wy. 
We have observed that s;,; > t;_;. Hence 


$1 <A@<t9 < SQ <0) < 53 < to: <t_y<u<b<t, 


This implies that the two concatenations described are paths and that their 
union is a cycle. By the definition of a, we have N(x) C P(1, 51) U P(a, to), and 
similarly N(y) C P(u, b) U P(t,, 1). With x and y themselves, the cycle thus has 
length at least 2+ d(x) +d(y)-—1>~m. a 


Ore proved that G is Hamiltonian if d(u) + d(v) > n(G) when u # v. 
Bondy’s Lemma implies the long cycle version of this, which strengthens the 
long cycle version of Dirac’s Theorem. 


oks 


8.4.37. Theorem. (Bondy [1971b], Bermond [1976], Linial [1976]) If G is 2- 
connected and d(v) + d(v) > s for every nonadjacent pair u, v € V(G), then 
c(G) > min{n(G), s}. 


Proof: Ore’s Theorem guarantees a Hamiltonian cycle if s > n, so we may 
assume that s < n. Suppose that P is a longest path in G, with endpoints x 
and y. Since G is connected, the maximality of P implies that x 4 y. Now the 


condition d(x) + d(y) > s allows us to invoke Lemma 8.4.36. a 


1.me/n 


Bermond extended this to a “long cycle” combination of Chvatal’s condition 
and Las Vergnas’ condition. The technique of edge-switches involving an end- 
point of a longest path was used in Theorem 8.4.35. Our statement is slightly 
weaker than that of Bermond but has a simpler proof. 


8.4.38. Theorem. (Bermond [1976]) Let G be a 2-connected graph with degree 
sequence d, < --. <d,. If G has no nonadjacent pair x, y with degrees /, j 
such that d; <i <c/2,dj,; < j, andi+ j <c,thenc(G) >c. 


Proof: Among the longest paths in G, let P = v1,..., v; with endpoints x = v,; 
and y = v; be chosen to maximize d(v1) +d(v). If d(x) +d(y) > c, then we apply 
Bondy’s Lemma. If d(x) + d(y) < c, then we claim that x, y contradicts the hy- 
potheses. As usual, an /-cycle would yield a longer path (since G is connected), 
so x # y. We may assume that d(x) < d(y) and set i = d(x) and j = d(y). 

All neighbors of x and y lie in P. Ifx = 1%, then P(v,_1, x), xvz, P(v%, y) 
is another longest path ending at y; thus d(vj_1) < d(x) = i, by the choice of 
P., Since this holds for each of the i neighbors of x, we have d; < i. Similarly, 
the j neighbors of y each have degree at most j. Also d(x) < j, so dj; < j. By 
hypothesis, i + j = d(x) + d(y) < c, which completes the contradiction. a 
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G.-H. Fan [1984] strengthened Theorem 8.4.37 by weakening the degree 
condition and by requiring it only for nonadjacent pairs with common neigh- 
bors. T. Feng [1988] used Bondy’s Lemma to shorten the proof. The result 
includes a sufficient condition for Hamiltonian cycles that does not require the 
closure to be complete. 


8.4.39. Example. A Hamiltonian graph. For even n, let Gj = Kn/2 and Gz = 
(n/4) Ke, and form G by adding a matching between disjoint copies of G; and Gz. 
The Hamiltonian closure of G is G itself, so our previous sufficient conditions do 
not apply. Even though G has n/2 vertices of degree 2, Fan’s Theorem implies 
that G is Hamiltonian. a 


8.4.40. Theorem. (Fan [1984]) If G is 2-connected, and dg(u, v) = 2 implies 
max{d(u), d(v)} > c/2, then c(G) > min{n(G), c}. 


Proof: (Feng [1988]) Let U = {v € V(G): d(v) = c/2}. By Bondy’s Lemma, it 
suffices to find a longest path having both endpoints in U. Among the paths 
of maximum length, let P = vj, ..-, ¥,, be one that has the maximum number 
of endpoints in U. If P fails to have both endpoints in U, then we will find a 
longer path or a path of the same length with more of its endpoints in U. We 
may assume that v; ¢ U. 

Since d(v) < c/2 for all v ¢ U, the hypothesis on pairs with distance 2 
implies that G — U is a disjoint union of complete graphs. Let Y be the one 
containing v;. Let X be the set of vertices in U having neighbors in Y. By the 
hypotheses, vertices of X have neighbors only in Y UU. Also |X| > 2, because 
G is 2-connected. 

Let r = |Y|. We first show that P begins by visiting all of Y. If P omits 
some vertex of Y, then we can absorb it before the first exit from Y. If P leaves 
and returns to Y, then it returns via an edge xy. Because G[Y] is complete, we 
can replace xy in P with v,y, obtaining an x, v,,-path having the same length 
as P but more endpoints in U. Hence we may assume that Y = {v1,..., v,}. 


Consider x € X — v,41. Suppose first that x has a neighbor y € Y other 
than the exit vertex v, of P. If x ¢ V(P), then-we can instead start with xy, 
absorb the rest of Y up to v,, and thus complete an x, v;-path longer than P. If 
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x € V(P), then we let x’ be the vertex before x on P. Since x ¥ v,11, we have 
x’ € U. We replace x’x in P with yx, obtaining an x’, 1-path with the same 
length as P but more endpoints in U. 


Hence we may assume for x € X — v,4; that x has no neighbor in Y other 
than v,. If |Y| > 2, this makes v, a cut-vertex unless v,4; has another neigh- 


bor y € Y —v,. Now we rearrange P to start with v,,..., v, ¥,41 instead of 
V1,..., Uy, Uy41- This puts us in the case just discussed. 

The remaining case is |Y| = 1 and N(v1) = X. With x € X — u,4, as before, 
we append x to the beginning of P or replace x’x with xv. | 


Finally, we present one result about digraphs that strengthens Ghouila- 
Houri’s sufficient condition (Theorem 7.2.22) for Hamiltonian cycles. We con- 
sider only loopless digraphs naving at. most one copy of each ordered pair as 
an edge; call these strict digraphs. For digraphs, we use “u, v nonadjacent” to 
mean uv, vu ¢ E(G). Also, we define d(v) = d*(v) + d~(v). 

Ghouila-Houri [1960] actually proved that a digraph G is Hamiltonian if 
d(v) > n(G) for each v; this is stronger than Theorem 7.2.22 as stated. Woodall 
[1972] proved that it suffices to have d*(u) + d~(v) > n(G) whenever u,v are 
nonadjacent. This generalizes Ore’s Theorem for undirected graphs (Exercise 
33). Meyniel [1973] proved that a strict strong digraph G is Hamiltonian if 
d(u)+d(v) > 2n(G)—1 for all nonadjacent pairs u, v. Meyniel’s Theorem implies 
Ghouila-Houri’s Theorem and Woodall’s Theorem (Exercise 33). 


8.4.41. Example. Meyniel’s Theorem is best possible. Let G consist of two 
doubly-directed cliques sharing a vertex. The digraph is strongly connected, 
and the only pairs of nonadjacent vertices consist of one vertex from each clique. 
If the cliques have order k and order n + 1 — k, then the total degrecs for any 
nonadjacent pair are 2k — 2 and 2n — 2k, which sum to 2” — 2. | 


8.4.42. Theorem. (Meyniel [1973]) If G 1s a strict strongly connected digraph 
such that d(u) + d(v) > 2n — 1 whenever uy, v are distinct nonadjacent ver- 
tices, then G is Hamiltonian. 


Proof: (Bondy—-Thomassen [1977]). We prove a technical lemma: if T = 
U1..-., Uy is a path that cannot absorb the vertex v internally (between two 
of its vertices), then the number of edges from v to T plus the number of edges 
from T to v is at most k+1. This follows by counting. Fer 1 < i < k—1, only one 
of the edges v;v and vu;,; is permitted. Also vv; and u,v are permitted; there is 
no restriction on absorption at the end. 
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We use this to prove the following statement: If G is a strict strong non- 
Hamiltonian digraph, and S is a maximal vertex subset having a spanning 
cycle (x1,...,%m) in G, then there exist v € S and integers a,b withl<a<m 
and 1 < b < m such that (1) x,v € E(G), (2) v is not adjacent to any x,4; with 
1 <i <b, and (8) d(v) +. d(xg4,) < 2n —1—b. Since b > 1, the conclusion of this 
statement is impossible under the hypothesis of the theorem, which will imply 
that the only maximal vertex set having a spanning cycle is V(G). 

Suppose first that no path leaves § and returns to it. Since G is strong and 
S 4 V(G), some cycle C of length at least 2 shares exactly one vertex with S. 
Let this vertex be x,, and let v be the successor of x, on C. By the path condition, 
there is no path between v and S — {x,} in either direction. In particular, each 
vertex outside S U {v} is incident to at most two edges also incident to v or 
Va41- Furthermore, v is incident to at most two edges also incident to S (the 
other endpoint must be v,). Finally, each vertex of S — vg; is incident to at 
most two edges also incident to vg,1. Summing the allowed contributions yields 
d(v) + d(%q41) < 2n — 2. Hence the desired condition holds with b = 1. 


elegram.me/math_books 


Now suppose that some path leaves S and returns to it. Choose such a path 
P so that the distance c along S from the start of P to the end of P is minimal. 
Let x, be the start of P, and let v be its successor on P. The maximality of S 
implies that c > 1. Let T be the portion of S frem x,4, to xz; this has m—c+1 
vertices. The maximality of S implies that v cannot be absorbed internally by 
T. Hence our technical lemma implies that v belongs to at most m —c +2 edges 
incident to T. The minimality of c makes v nonadjacent to x241,..., X¢-1.- 

Let b be the largest integer in [c] such that G has a path from x,4, to xq 
with vertex set S — {Xain,---,Xa+c-1}. Let R be such a path (the path T with 
b = 1 implies that R exists.) Since P U R is a cycle, the maximality of S yields 
b <c. By the maximality of b, x,4, is not absorbed internally by R. Hence, by 
our technical lemma, x,+, belongs to at most m —c+b-+1 edges incident to R. 

Now we count d(v) + d(xg,). Each vertex outside S U {v} is incident to at 
most two edges also incident to {v, x,4,}, because the minimality of c prevents 
a path of length 2 between v and x,4, (in either direction) using a vertex not in 
S. We have observed that v belongs to at most m — c + 2 edges incident to S. 
We have observed that v,,, belongs to at most m —c+b-+ 2 edgcs incident to 
R. Finally, x,., belongs to at most 2(c — b — 1) edges incident to S — R. Hence 
d(v)+d(xq4n) < 2(n -—m—1)+(m—c4+2)4+(m—c+b4+1)+2(c—b—-1) = 2n—-1-b. 
Again we have obtained the desired condition. a 
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EXERCISES 


8.4.1. Let m = |n?/4|. Prove that every n-vertex graph has ani intersection represen- 
tation using subsets of [m] such that each element of [m] appears in at most three sets. 
Equivalently, every n-vertex graph decomposes into at most m edges and triangles. 


8.4.2. Prove that the following conditions on a graph G with no isolated vertices are 
equivalent. (Choudom—Parthasarathy—Ravindra [1975]) 

A) 0"(G) = a(G). 

B) 6G V G) = (6'(G))?. 

C) 6’(G) = @(G). 

D) Every clique in a minimum clique cover of E(G) uses a simplicial vertex of G. 


8.4.3. (+) Let b(G) be the minimum number of bipartite graphs needed to partition 
E(G) (called biparticity). Let a(G) denote the minimum number of classes needed to 
partition E(G) such that every cycle of G contains a non-zero even number of edges from 
some class. Prove that these parameters both equal [lg x¥(G)]. (Hint: Prove lg x(G) < 
b(G) < a(G) < [lg x(G)].) (Harary—Hsu—Miller [1977], Alon—Egawa [1985]) 


8.4.4. Determine all the n-vertex graphs that have product dimension n — 1. (Lovasz— 
NeSetril—Pultr [1980]) 


8.4.5. Prove that pdim G < 2 if and only if G is the complement of the line graph of a 
bipartite graph (Lovdsz—NeSetril—Pultr [1980]) 


8.4.6. Given r, compute pdim (K, +mK}) for all m > 1. (Lovdsz—NeSetril—Pultr [1980]) 
8.4.7. (—) Compute the product dimension of the three-dimensional cube. 


8.4.8. Obtain upper and lower bounds on the product dimension of the Petersen graph 
that differ by 1 (the upper bound will most likely be the correct value, but showing that 
it cannot be improved is tedious). _ 


8.4.9. Let f(n) be the maximum value of pdim G - pdim G over all graphs on n vertices. 
Prove that |n?/4| < f(n) < (n- 1). 


8.4.10. For n > 4, prove that pdim P, = [lg(n — 1)]. For n > 3, prove that pdim C2, = 
1+ [lg(n — 1)] and 1+ [lgn] < pdim C2,4; < 2+ [lgn]. (Lovasz—NeSetril—Pultr [1980]) 
(Comment: Evans—Fricke—Maneri—McKee—Perkel [1994] showed that pdim Con.) = 1+ 
[lgn] except possibly when n is a power of 2.) 


8.4.11. Prove that Cy.) is not isometrically embeddable in any cartesian product of 
cliques if k > 1. 


8.4.12. Determine the squashed-cube dimension of C3. 


8.4.13. (+) Determine the squashed-cube dimension of K33. (Hint: Use symmetry to 
reduce case analysis.) 


8.4.14. (!) Use Edmonds’ Branching Theorem (Theorem 8.4.20) to prove the edge version 
of Menger’s Theorem in digraphs: A’(x, y) = «'(x, y). (Hint: Devise an appropriate graph 
transformation to obtain a short proof.) 


8.4.15. (!) The gossip problem is also called the “telephone problem”, and the corre- 
sponding problem for directed graphs is called the “telegraph problem”. As a function 
of n, determine the minimum number of one-way transmissions among n people so that 
each person has a transmission path to every other. (Harary—Schwenk [1974]) 
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8.4.16. Let D be a digraph solving the telegraph problem in which each vertex receives 
information from each other vertex exactly once. Prove that in D at least n — 1 vertices 
hear their own information. For each n, construct such a D in which only n — 1 vertices 
hear their own information, but for each x + y there is exactly one increasing x, y-path. 
(Seress [1987]) 


8.4.17. The NOHO property. 

a) Let G be a connected graph with 2n — 4 edges having a linear ordering that 
solves the gossip problem and satisfies NOHO (no increasing cycle). Suppose also that 
n(G) > 8 and that at most two vertices have degree 2. Prove that the graph obtained by 
deleting the first calls and last calls of vertices in G has 4 components, of which two are 
isolated vertices and two are caterpillars having the same size. (West [1982a]) 

b) For every even n > 4, construct a connected ordered graph with 2n — 4 edges 
that satisfies the NOHO property. (Hint: Make use of the structural properties proved 
in part (a) to guide the search.) 


8.4.18. A NODUP scheme (NO DUPlicate transmission) is a connected ordered graph 
that has exactly one increasing path from each vertex to every other. 

a) (—) Prove that every NODUP scheme has the NOHO property. 

b) Prove that there is no NODUP scheme when n ¢ {6,10, 14,18}. (Comment: 
Seress [1986] proved that these are the only even values of n for which NODUP schemes 
do not exist, constructing them for all other values. For n = 4k, West [1982b] constructed 
NODUP schemes with 9n/4 — 6 calls, and Seress [1986] proved that these are optimal.) 


8.4.19. A vertex in a simple graph G wishes to broadcast information to all other ver- 
tices. In each time unit, each vertex that already knows the information can make one 
call to a neighbor that does not know the information. The time required to broadcast 
from v is the minimum number of time units in which all vertices can learn the infor- 
mation. Construct an n-vertex graph G with fewer than 2n edges such that every vertex 
of G can broadcast in time at most 1 + Ign. . (Grigni—Peleg [1991]) 


8.4.20. (!) Prove that the graph below is not 2-choosable. 


8.4.21. Prove that Kim is k-choosable if and only ifm < k* (Erdés—Rubin—Taylor [1979]) 


8.4.22. Prove that x;(G) < 1+ maxycg 6(G) and that x;(G) + x:(G) <n +1. Prove also 
that x/(G) < 2A(G) — 1. 


8.4.23. Prove that every chordal graph G is x (G)-choosable. 


8.4.24. Prove that a connected graph G has a proper list coloring from lists such that 
|L(v)| = d(v) for all v if there is strict inequality for at least one vertex. 


8.4.25. (!) Prove that G has a total coloring (Remark 8.4.31) with at most x/(G) + 2 
colors. 


8.4.26. (!) Non-4-choosable planar graph of order 63. _ 
a) In the list assignments for the graph below, 5S denotes [4] and i denotes S - {i}. 
-Prove that this graph has no proper coloring chosen from these lists. 
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b) In the list assignments for the graph G below, i denotes [5] — {i}; each list has 
size 4. Let G’ be the graph obtained from G by adding one vertex with list 1 adjacent to 
all vertices on the outside face of this drawing of G. Prove that G’ has no proper coloring 


chosen from these lists. (Mirzakhani [1996]) 


8.4.27. (!) Equivalence of Dilworth’s Theorem and Kénig-Egervary Theorem. 

a) Given a bipartite graph G, apply Dilworth’s Theorem to a transitive orientation 
of it to obtain the Konig—Egervary Theorem. 

b) Given a transitive digraph D, let G be the split of D as defined in Definition 
1.4.20. Apply the Kénig—Egervary Theorem to G to obtain Dilworth’s Theorem for D. 


n.me/matlysye 


8.4.28. (!) Prove that K, decomposes into [n/2] paths. Prove that K, decomposes into 
[n/2]| cycles when n is odd. 


8.4.29. (!) Decomposition of K, into spanning connected subgraphs. 
a) Prove that if K, decomposes into k spanning connected subgraphs, then n > 2k. 
b) Prove that Ka, decomposes into k spanning trees of diameter 3. (Hint: Let the 
central edges of these trees form a perfect matching.) (Palumbiny [1973]) 


8.4.30. Prove that every 2-edge-connected 3-regular simple planar graph decomposes 
into paths of length 3. Prove the same statement for planar triangulations. (Jiinger— 
Reinelt—Pulleyblank [1985]) 


8.4.31. Prove that Theorem 8.4.35 is best possible when m — 1 divides n — 1. 


8.4.32. Let G be a graph such that G is triangle-free and not a forest. Prove that G has 
a cycle of length at least n(G)/2. (Hint: Use Theorem 8.4.37.) (N. Graham) 


8.4.33. Use Woodall’s Theorem to prove Ore’s Theorem, and use Meyniel’s Theorem to 
prove Woodall’s Theorem. 


8.4.34. Use Meyniel’s Theorem to prove that a strict n-vertex digraph has a spanning 
path if d(u) + d(v) > 2n — 3 for every pair u, v of distinct nonadjacent vertices. 
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8.5. Random Graphs 


In its simplest form, the probabilistic method is used to prove the existence 
of desired combinatorial objects without constructing them. An appropriate 
probability model is defined on a large class of objects. The occurrence of the 
desired structure is an event. If this event has positive probability, then some 
object with the desired structure exists. Designing the model and applying 
probabilistic and asymptotic techniques may involve considerable art. 

We discuss these methods in the context of random graphs. The study of 
random graphs is itself motivated by the modeling of physical properties and 
by the analysis of algorithms in computer science. 


8.5.1. Example. Melting points. The behavior of random graphs suggests 
a mathematical explanation for melting points. Think of a solid as a three- 
dimensional grid of molecules, with neighboring molecules joined by bonds. 
For example, consider the graph P;0 PD P,,, with bonds corresponding to edges. 
Adding energy excites molecules and breaks bonds. We assume that bonds 
break at random as we raise the temperature (energy level). Each temperature 
corresponds to some fraction of bonds broken. While the graph remains largely 
connected, the material seems solid. Breaking off small pieces doesn’t change 
this, but when all the components are small the global nature of the material 
changes. Small components of ‘molecules float freely, like a liquid or gas. 
Mathematically, there is a threshold for the number of bonds to be broken 
(in terms of the size of the grid) such that almost every way of breaking some- 
what fewer bonds leaves a giant component, and almost every way of breaking 
somewhat more bonds leaves all components being tiny. Just below the thresh- 
old temperature the material will almost certainly be a solid, and just above it 
the material will almost certainly not be a solid. a 


8.5.2. Example. Analysis of algorithms. Worst-case complexity is the maxi- 
mum running time for an algorithm over all inputs of size n (see Appendix B). 
For difficult problems, we may seek an algorithm that takes many steps on a 
few bizarre graphs while running quickly on most graphs. We need a way to 
describe the usefulness of such algorithms. 

The answer is probabilistic analysis. We assume a probability distri- 
bution on the inputs and study the expected running time with respect to this 
distribution. Choosing a realistic distribution can be difficult. In practice, we 
choose a probability distribution that makes the analysis feasible. We cannot 
define a probability distribution over infinitely many graphs, so we define a 
distribution on the graphs of each order. This is consistent with viewing the 
expected running time as a function of the input size. a 
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Erdés and Rényi [1959] introduced random graphs. The subject developed 
rapidly in the 1980s, with books by Bollobas [1985], by Palmer [1985], and by 
Alon and Spencer [1992] (the last treats broader combinatorial applications of 
probabilistic methods). The book Janson—Luczak—Rucinski [2000] emphasizes 
later developments. 

More sophisticated probabilistic techniques than we can present here are 
now being applied to random graphs. We describe the basic techniques and 
suggest the flavor of the subject, with no attempt at exhaustive treatment. 


EXISTENCE AND EXPECTATION 


We begin by showing how probabilistic methods can prove existence state- 
ments. Suppose we want to prove that an object with some desired property 
exists. We define a probability space where occurrence of the desired property 
is an event A. If A has positive probability, then the desired object exists. 


8.5.3. Definition. A discrete probability space or probability model is a 
finite or countable set S together with nonnegative weights on the elements 
that sum to 1. An event is a subset of S. The probability P(A) of an 
event A is the sum of the weights of the elements of A. Events A and B are 
independent if P(A B) = P(A)P(B). 


Erdés popularized the probabilistic method in 1947 by using it to prove 
lower bounds on Ramsey numbers (Definition 8.3.6). We phrased this combi- 
natorially in Theorem 8.3.12; here we present the same proof in probabilistic 
language. It uses the observation that P(\); Ai) < >); P(Ai). Note that in this 
section, all graphs are simple. 


8.5.4. Theorem. (Erdés [1947]) If (")2'-@ < 1, then R(p, p) > n. 


Proof: It suffices to show that when (")21-@) < 1 there is an n-vertex graph G 
with w(G) < p and a(G) < p. We define a probability model on graphs with 
vertex set [n] by letting each edge appear independently with probability .5. If 
the probability of the event Q = “no p-clique or independent p-set” is positive, 
then the desired graph exists. 

Each possible p-clique occurs with probability 2~©), since obtaining the 
complete graph requires obtaining all its edges, and they occur independently. 
Hence the probability of having at least one p-clique is bounded by (")2-@), 
The same bound holds for independent p-sets. Hence the probability of “not Q” 
is bounded by (")21-@), and the given inequality guarantees that P(Q) > 0. & 


8.5.5. Remark. Existence arguments can be used as probabilistic construction 
algorithms. The probability that a random 1024-vertex graph has a 10-clique 
or independent 10-set is less than 2!" /20!. If the first random one doesn’t work, 
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generate another; the probability of continued failure is the product of these 
small numbers and soon becomes incomprehensibly small. | 


The lower bound in Theorem 8.5.4 is roughly Ja: the inductive upper 
bound in Theorem 8.3.11 is roughly 4*. The gap is large. More sophisticated 
probabilistic methods have achieved only small improvements in the lower 
bound. Nevertheless, the constructive bounds are much weaker, so this is a 
triumph for the probabilistic method. The proof is essentially just a count- 
ing afgument. Many probabilistic arguments with finite sample spaces can be 
rephrased as weighted counting arguments, but the proofs are simpler in the 
language of probability. 

The introduction of random variables adds considerable power. We assign 
values to the elements of our probability space.t We have already used the 
comparison between the average and maximum values of a random variable to 
prove inequalities. 


8.5.6. Definition. A random variable is a function assigning a real number 
to each element of a probability space. We use X = k to denote the event 
consisting of all elements where the variable X has the value k. 

The expectation F(X) of a random variable X is the weighted aver- 
age )|,kP(X =k). The pigeonhole property of the expectation is the 
statement that there exists an element of the probability space for which 
the value of X is as large as (or as small as) E(X). 


um 


Applying the pigeonhole property requires a value or bound for E(X). Of- 
ten the computation applies the linearity of expectation to an expression for 
X in terms of simpler random variables. For our purposes, we generally restrict 
our attention to probability models on finite sets and sum only finitely many 
random variables. Analogous results hold in continuous probability spaces. 


8.5.7. Lemma. (Linearity property) If X and the finite set {X;} are random 
variables on the same space and X = )- X;, then E(X) = > E(X;). Also 
E(cX) = cE(X) forc € R. 


Proof: In a discrete probability space, each element contributes the same 
amount to each side of the desired equations. | 


We often apply Lemma 8.5.7 to random variables that count substructures. 
Such a random variable is a sum of variables indicating whether one of the pos- 
sible things being counted actually occurs. These indicator variables take 
values in {0, 1} (they are also called 0, 1-variables). The expectation of an ‘in- 
dicator variable is the probability that it equals 1. These properties facilitate 
what was perhaps the first use of the probabilistic method. 


TWe consider only discrete probability spaces, but analogous concepts hold for continu- 
ous probability spaces. 
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8.5.8. Theorem. (Szele [1943]) Some n-vertex tournament has at least n!/2"-1 
Hamiltonian paths. 


Proof: Generate tournaments on [n] randomly by choosing i —> j or. j —> i 
with equal probability for each pair {i, j}. Let X be the number of Hamiltonian 
paths; X is the sum of n! indicator variables for the possible Hamiltonian paths. 
Each Hamiltonian path occurs with probability 1/2"-!, so E(X) = n!/2"-!. In 
some tournament, X is at least as large as the expectation. | 


This simple bound using expectation gives almost the right answer for 
the maximum number of Hamiltonian paths in an n-vertex tournament; Alon 
[1990] proved that it is at most n!/(2 — 0(1))”. When almost all instances have 
a value near the extreme, probabilistic arguments are especially effective. 

Many inequalities can be interpreted as statements about the expected 
value of a random variable. This often yields a shorter proof than combinatorial 
methods. Exercise 3.1.42 requests a combinatorial proof of the next result. 


8.5.9. Theorem. (Caro [1979], Wei [1981]) e(G) > reve) ari for every 
graph G. 


Proof: (Alon—Spencer [1992, p81]) Given an ordering of the vertices of G, the 
set of vertices that appear before all their neighbors form an independent set. 
When the ordering is chosen uniformaly at random, the probability that v ap- 
pears before all its neighbors is 1/(d (v)+ 1). Thus the right side of the inequality 
is the expected size of the independent set formed by choosing the vertices ap- 
pearing before their neighbors in a random vertex ordering. | 


When a randomly generated object is close to having a desired property, a 
slight alteration may produce it. This technique is called the deletion method, 
the alteration method, or the two-step method. Ramsey numbers furnish 
a classical application (Exercise 16). We provide two others. 

Recall that S C V(G) is a dominating set in G if every vertex outside S$ has 
a neighbor in S (Definition 3.1.26). When G is k-regular, every vertex dominates 
k +1 vertices (including itself), so every dominating set has at least n(G)/(k +1) 
vertices. The alteration method yields a dominating set close to that bound in 
every graph with minimum degree k. The argument, like many involving tnese 
techniques, uses the fundamental inequality 1 — p < e~” (%:xercise 2). 


8.5.10. Theorem. (Alon [1990]) Every n-vertex graph with minimum degree 
k > 1 has a dominating set of size at most n ein 


Proof: In such a graph G, select a random set § C V(G) by including each 
vertex independently with probability p = In(k + 1)/(k + 1). Given S, let T be 
the set of vertices outside S having no neighbor in S; adding T to S yields a 
dominating set. We seek the expected size of S$ UT. 

Since each vertex appears in S$ with probability p, linearity vields E(|S|) = 
np. The random variable |7| is the sum of 1 indicator variables for whether 
individual vertices belong to 7. We have v ¢€ T if and only if v and its neighbors 
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all fail to be in S. This has probability bounded by (1— p)**1, since v has degree 
at least k. Since (1 — p)*t! < e7?"+), we have E(|S| + |T|) < np +ne?*"+) = 
n tins The pigeonhole property of the expectation completes the proof. @ 


This easy bound yields almost the smallest s, such that every graph G with 
minimum degree k has a dominating set of size at most s,n(G) (Alon [1990]). A 
greedy algorithm proves the same result constructively (Theorem 3.1.30). 

A striking and famous application of the deletion method is the existence 
of graphs with large girth and chromatic number. Explicit constructions came 
much later (Lovasz [1968a], NeSetril—Rédl [1979], Kriz [1989]). We present a 
simplification of the original proof (Alon—Spencer [1992, p35]). It uses a prop- 
erty of the expectation that we will prove in Lemma 8.5.17. 


8.5.11. Theorem. (Erdés [1959]) Given m > 3 and g > 3, there exists a graph 
with girth at least g and chromatic number at least m. 


Proof: We generate graphs with vertex set [n] by letting each pair be an edge 
with probability p, independently. A graph with no large independent set has 
large chromatic number, since x(G) > n(G)/a(G). We therefore choose p large 
enough to make large independent sets unlikely. We also choose p small enough 
to make the expected number of short cycles (length less than g) small. Given 
a graph satisfying both conditions, we ean delete a vertex from each short cycle 
to obtain the desired graph. 3 


ot 
e 


To make it unlikely that we generate more than n/2 short cycles, we let 
p = n'-1, where t < 1/g. Each of the possible cycles of length j occurs with 
probability p’. Since there are njj;)/(2 j) of these for each j, the total number X 
of cycles of length less than g has expectation 


E(X) = 1873 nay pi /(2i) < 8g n"/(2). 


Since tg < 1, this implies that E(X)/n — 0 asn — oo. In Markov’s Inequality 
we will complete the details of concluding from this that P(X > n/2) > 0 as 
n— oo. Forn large enough, P(X >n/2) < 1/2. 

Since a(G) cannot grow when we delete vertices, at least (n — —X)/a(G) 
independent sets are needed to color the vertices remaining when we delete a 
vertex of each cycle. If X < n/2 and a(G) < n/(2k), then at least k colors are 
needed for the graph remaining. With r = [3lnn/p], we have 


P(a(G) =r) < (")A- p)® < [ner], 


This approaches 0 as n grows. 

Since r = [38n!' Inn| and k is fixed, we can choose n large enough to obtain 
r < n/(2k). If we also choose n large enough so that P(X > n/2) < 1/2 and 
P(a(G) > r) < 1/2, then there will exist an n-vertex graph G such that a(G) < 
n/(2k) and such that G has fewer than n/2 cycles of length less than g. We 
delete a vertex from each short cycle and retain a graph with girth at least g 
and chromatic number at least k. a 
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PROPERTIES OF ALMOST ALL GRAPHS 


We have proposed studying properties that “almost always” hold. This 
phrase has meaning in the context of a probability model. 


8.5.12. Definition. Given a sequence of probability spaces, let g, be the prob- 
ability that property Q holds in the nth space. Property Q almost always 
holds if lim,_, 40 gn = 1. 


For us, the nth space is a probability distribution over n-vertex graphs. 
When property Q almost always holds, we say “almost every graph has prop- 
erty Q”. Making all graphs with vertex set [n] equally likely is equivalent to 
letting each vertex pair appear as an edge with probability 1/2. Models where 
edges arise independently with the same probability are the most common for 
random graphs because they lead to the simplest computations. We allow this 
probability to depend on n. 


8.5.13. Definition. Model A: Given n and p = p(n), generate graphs with ver- 
tex set [n] by letting each pair be an edge with probability p, independently. 
Each graph with m edges has probability p”(1 — p)@)-™, The random vari- 
able G’ denotes a graph drawn from this probability space. “The random 
graph” means Model A with p = a? which makes all graphs with vertex 
set [n] equally likely. 


1m.me/me 


Computations are much simpler fof graphs with a fixed vertex set (“labeled” 
graphs) than for random isomorphism classes. Since inputs to algorithms are 
graphs with specified vertex sets, this model is consistent with applications. 

We often measure running times of algorithms in terms of the number 
of vertices and number of edges; hence we may want to control the number of 
edges. This suggests a model in which the n-vertex labeled graphs with m edges 
are equally likely. (We use m to count edges in this section because the number 
e = 2.71828... plays an important role in asymptotic arguments.) 


8.5.14. Definition. Model B: Given n and m = m(n), let each graph with 


vertex set [n] and m edges occur with probability (* ‘ee where N = (5). The 
random variable G” denotes a graph generated in this way. 


These two are the most common of many models studied. Model B seems 
more pertinent for applications. We ask questions like “as a function of n, how 
many edges are needed to make a graph almost surely connected?” In Model 
A we would say, “as a function of n, what edge probability is needed to make a 
graph almost surely connected?” Unfortunately, calculations needed to answer 
such questions are messier in Model B than in Model A. 

Fortunately, Model B is accurately described by Model A when n is large 
and p =m /(5); because the actual number of edges generated in Model A is 
almost always very close to the resulting expectation m. The correspondence 
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is valid for most properties of interest. The proof of this requires detailed use 
of the binomial distribution for the number of edges. A graph property Q is 
convex if G satisfies Q whenever F C G C H and F, FH satisfy Q. 


8.5.15. Theorem. (Bollobds [1985, p34-35]) If Q is convex and p(1 — p)(5) = 
oo, then almost every G? satisfies Q if and only if, for every fixed x, almost 
every G” satisfies Q, where m = | p(5) + x[p(1 — p)(5)]"” |. my 


Theorem 8.5.15 justifies restricting our attention to Model A. It also mo- 
tivates letting p be a function of n; to study graphs with a linear number of 
edges, we must let p vanish at a rate like c/n, where c is constant. Constant p 
yields dense graphs. 

Proving P(Q) — 1 is usually much easier than computing P(Q); this 
distinction is important. Exact computation of probabilities is difficult, unnec- 
essary, and avoided wherever possible. Instead we use asymptotic analysis, 
which rests on limits. We write a, — L for lim,_,..a, = L. To compare growth 
rates of sequences, we use “big O” and “little o” notation (see Appendix B for 
definitions). We write a, = b,(1 + 0(1)) when (a) and (b) differ by a sequence 
that grows more slowly than (b); equivalently, a, /b, — 1. When a,/b, — 1, we 
say that a, is asymptotic to b,, written An ~ by. 

We use asymptotic statements to discard lower-order terms that don’t af- 
fect whether lim,_,.. P(Q) = 1. Computing P(Q) first and then proving that 
the formula tends to 1 is harder and is unnecessary. We need only show that 
P(-—Q) is bounded by something tending to 0. Many asymptotic arguments are 
“sloppy” in this sense; we don’t care how loose the bound is as long as it tends 
to 0. Experience refines our intuition about what can be discarded safely. 


8.5.16. Theorem. (Gilbert [1959]) When p is constant, almost every G?’ is 
connected. 


Proof: We can make G disconnected by picking a vertex partition into two sets 
and forbidding edges between the two sets. Occurrence of edges within the sets 
is irrelevant. We bound the probability g, that G? is disconnected by summing 
P({S, S] = @) over all bipartitions S,S. Graphs with many components are 
counted many times. When |S| = k, there are k(n — k) possible edges in [S, S]. 
Each has probability 1 — p of not appearing, independently, so P([S, S] = @) = 
(1 — p)*"-, Considering all S generates each partition from each side, so 
qn < ip iC — pyr? 
This formula is symmetric in k and n —k; hence gq, is bounded by 
ba ra ({)(1 — p)*"-®. We loosen the bound to simplify it. Using (/) < n‘* 
and a — p)?* < (1— p)"”? (for k < n/2) yields g, < yn — p)"/?)*. For 
large enough n, we have n(1 — p)"/2 < 1. This makes our bound the initial 
portion of a convergent geometric series. We obtain g, < x/(1— x), where 
x = n(1— p)"/*. Since n(1 — p)"/2 > 0 when p is constant, our bound on gq, 
approaches 0 as n — oo. a 
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We avoid struggling with probability formulas by introducing integer- 
valued random variables and techniques involving expectation. If X is a non- 
negative random variable such that X = 0 when G’ has property Q, then 
E(X) — 0 implies that almost every G” satisfies Q. This is a special case of 
the following lemma. We prove it only for integer variables, but it also holds 
for continuous variables. 


8.5.17. Lemma. (Markov’s Inequality) If X takes only nonnegative values, then 
P(X >t) < E(X)/t. In particular, if X is integer-valued, then E(X) > 0 
implies P(X = 0) — 1. 


Proof: E(X) = oyso KP = Lees, KPR Zt Vas, Pe =P (X > 1). | 


For connectedness, we can define X(G”) by X == 1 if G is disconnected and 
X = Ootherwise. The expectation of an indicator variable is the probability that 
it equals 1. We proved P(X = 1) — 0(when p is constant) to prove that almost 
every G”’ is connected. With a different random variable we can simplify the 
proof and strengthen the result. We still want G to satisfy Q if X = 0 (in order 
to apply Markov’s Inequality), but we don’t need (X = 0) © (G satisfies Q). We 
use a sum X of many indicator variables, such that G satisfies Q if X = 0. The 
linearity of expectation and convenience of E(X;) = P(X; = 1) for the indicator 
variables simplify the task of proving E (X) > 0. 


50) 


1 
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8.5.18. Theorem. If p is constant, then almost every G, has diameter 2 (and 
hence is connected). 


gram 


Proof: Let X(G”) be the number of unordered vertex pairs with no common 
neighbor. If there are none, then G, is connected and has diameter 2. By 
Markov’s Inequality, we need only show E(X) — 0. We express X as the sum 
of (5) indicator variables X; ;, one for each vertex pair {v;, vj}, where X;,; = lif 
and only if v;, v; have no common neighbor. 

When X;,; = 1, the n —2 other vertices fail to have edges to both of these, so 
P(X;,; = 1) = (1— p?)""? and E(X) = (3)(1— p”)" *. When p is fixed, E(X) — 0, 
and hence almost every G, has diameter 2. a 


The intuition behind this argument, made precise by Markov’s Inequality, 
is that if we expect almost no bad pairs, then almost every graph has none. The 
summation disappears, and for the limit we need only know that (1 — p?)"~? 
tends to 0 faster than any polynomial function of n. 


THRESHOLD FUNCTIONS 


Roughly speaking, random graphs with constant edge probability are con- 
nected because they have many more edges than needed to be connected. To 
improve Theorem 8.5.18, we want to make p(n) as small as possible to have 
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almost every G” connected. We need the notion of a threshold probability func- 
tion. By the relationship between Model A and Model B, a threshold edge 
probability also yields a threshold number of edges. 


8.5.19. Definition. A monotone property is a graph property preserved by 
addition of edges. A threshold probability function for a monotone 
property Q is a function t(n) such that p(n)/t(n) — O implies that al- 
most no G? satisfies Q, and p(n)/t(n) — oo implies that almost every G’ 
satisfies Q. Threshold edge function is defined similarly for Model B. 


This is a broad notion of threshold function; it allows a property to have 
many threshold functions. A threshold function t(n) is “sharper” when the 
“almost‘surely” behavior occurs when p(n)/t(n) approaches nonzero constants. 
Still sharper is a threshold t (n) such that this behavior occurs when p(n) differs 
from t(n) by the subtraction or addition of a lower-order term. 

Markov’s Inequality does half the job of deriving a threshold function. If 
X = 0 implies property Q and we prove that E(X) — 0, then P(Q) > 1. We 
obtain candidates for threshold functions by determining which functions p(n) 
yield E(X) — 0. Often we obtain p(n) such that E(X) > 0 or E(X) > ~, 
depending on the value of a parameter c. The property E(X) — oo suggests 
that P(X = 0) — 0, but this does not always follow. For example, E(X) — oo 
when P(X = 0) = .5 and P(X =n) =.5. To obtain P(X = 0) — 0, we must 
prevent the probability from spreading out like this. 


8.5.20. Definition. The rth moment of X is the expectation of X’. The vari- 
ance of X, written Var(X), is the quantity E[(X — E(X))"]. The standard 
deviation of X is the square root of Var(X). 


8.5.21. Lemma. (Second Moment Method) If X is a random variable, then 


P(X =0)< PY In particular, P(X = 0) > 0 when aoe > 1. 


Proof: Applied to the variable (X — E(X))? and the value t?, Markov’s In- 
equality yields P[(X — E(X))? > t?] < E[(X — E(X))?]/t?..We rewrite this as 
P[|X — E(X)| = t] < Var(X)/t? (Chebyshev’s Inequality). Since 


E((X — E(X))?] = EX? — 2XE(X) + (E(X))*] = E(X”) — (E(X))’, 


Chebyshev’s Inequality becomes P[|X — E(X)| >t] < (E(X?)-E(X)”)/t?. Since 
X = 0 only when |X — E(X)| > E(X), setting t = E(X) completes the proof. 


Intuitively, if the mean grows and the standard deviation grows more 
slowly, then all the probability is pulled away from 0, and P(X = 0) > 0 
results. We illustrate the method by considering the disappearance of isolated 
vertices. Since a connected graph has no isolated vertices, a threshold for con- 
nectedness must be at least as large as a threshold for disappearance of isolated 
vertices. The computations for the latter are simpler, because we can express 
this condition using a sum of identically distributed indicator variables with 
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easily computed expectations. In fact, both properties have the same thresh- 
old, since it happens that at the threshold almost every graph consists of one 
huge component plus isolated vertices. 


8.5.22. Theorem. In Model A, Inn/n (natural logarithm) is a threshold proba- 
bility function for the disappearance of isolated vertices (that is, 5(G) > 1). 
(The corresponding threshold in Model B is $n Inn.) 


Proof: Let X be the number of isolated vertices, with X; indicating whether ver- 
tex i is isolated. Then E(X) = )- E(X;) = n(1— p)""!. We study the asymptotic 
behavior of E(X) in terms of p(n). Since 


(1 ~ p)" => et ln(1-p) = eWnP emp L1/2+p/3+--1 


our expression for E(X) simplifies asymptotically if np? + 0. This is equivalent 
to p € o(1/./n) and implies (1 — p)" ~ e~”” and (1 — p)~! ~ 1, yielding E(X) ~ 
ne~"?, To simplify further, set p = clnn/n to obtain ne~"? = n!~°, where c may 
depend on n. Constant c yields p € 0(1/,/n), as we needed earlier. When c > 1, 
we have E(X) ~ n!~ -» 0, which proves one side of the threshold. 

When c < 1, we have E(X) — oo and use the second moment method. We 
need only show that E(X?) ~ E(X)?.2This uses another helpful property of 
indicator variables: X? = X;. Thus, 


; 1_ books 


E(X?) = yyy E(XP) + Dig; E(X;X)) = E(X) + n(n — 1)E(X;X)). 


The indicator variable X;X; has value 1 only when v, and v; are both isolated, 
which forbids 2(n — 2) +1 sdpee: Thus E (X;X;) = (1— p)*"-*. Again (1 — py ~ 
e~"?, so E(X;X;) ~ e~?"”, and 


E(X?) ~ E(X) + n(n — 1)e72"? ~ E(X) + E(X)?. 
Since E(X) — 60, this yields E(X?) ~ E(X)?. a 


Theorem 8.5.22 is stronger than required by the definition of threshold 
function. The threshold is sharper: we guarantee or forbid isolated vertices 
when the ratio of p(n) to Inn/n approaches a nonzero constant, not 0 or oo. 

In fact, yet sharper information is known about the threshold for isolated 
vertices. When p = lgn/n+x/nand X counts the isolated vertices, P(X =k) ~ 
e~“ u*/k!, where u = e~*. (Readers may recognize this limiting distribution as 
the Poisson distribution.) For k = 0, we have P(X = 0) ~ e~”. Thus this 
additive term in p describes the movement through the threshold from almost 
always isolated vertices to almost never isolated vertices. Many such sharp 
thresholds are known, but the techniques for deriving the asymptotic Poisson 
distribution are beyond our scope here. 

Next we derive a threshold function for the appearance of fixed subgraphs. 
A graph is balanced if the average vertex degree in every induced subgraph is 
no larger than the average degree of the entire graph. All regular graphs and 
all forests are balanced. 
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8.5.23. Theorem. If H is a balanced graph with k vertices and / edges, then 
p =n’! is a threshold function in Model A for the appearance of H asa 
subgraph of almost every G”. 


Proof: Let X be the number of copies of H in G”; X is the sum of indicator 
variables for the possible copies of H in K,. There are n(n —1) --(n-—k +1) 
ways to map V(/7) into [n]. Each copy of H aries A canes where A is the 
number of automorphisms of H. We thus have + 1 Ty! = on — i) variables X;. 


Since a copy of H occurs when its edges occur, P(X, = 1) = p’. Because k is 
fixed, E(X) ~ nk p'/A. 

Setting p(n) =c,n-*/' yields E(X) ~ c!,/A. Hence c, — 0 yields E(X) — 0, 
and «,, > o yields E(X) — oo. It remains only to obtain E(X?) ~ E(X)* when 
C, > oo. Again E(X*) = E(X)+ isi E(X;,X;). The summands are not equal; 
E(X;X;) depends on H' = H;™ H;. We group the terms by the choice of H’ C H. 
When H’ has r vertices and s edges, the number of edges needed to create H; 
and H; is 2/ — s, so E(X;X;) = p?. 


To specify pairs j. j such that H’ = H; M H;, we choose r vertices for H’, 
k —r vertices for each of H; — H' and fe H', and an extension of H’ to each of 
those sets. The number of ways to choose the vertex sets is OES 
which is asymptotic to n~"/[r!(k — ry!?]. The number M of ways to extend 
H’ to obtain copies of H in both specified k-sets depends only on H and H’; 
it is independent of n and p. Let a; be the constant M/[r!(k —r)!?]. The 
contribution to )* E(X;X;) from pairs /, j such that H; MH; = H' is asymptotic 
to ay n2—" p2!-s: we call this En. 

When r = s = 0, we have M = (k!/A)*. Hence ay: ~ n™ p%/A? ~ E(X)? 
when H’ is the “null graph”. This is the total contribution to > E(X;X;) for 
all i, j with H;, H; disjoint and is asymptotic to E(X)*. The proof is completed 
by showing that the total enna aa from all other choices of H’ has lower 
order. We have Ey ~ ay: A7E(X)?n~“ p™. Since 2s/r is the average degree of 
H', the hypothesis that H is ae yields 2r/s > 2k/1, or pn'’ > pri! > 
ore) ‘when C, — oo. Since pn'/’ — oo is equivalent to nn" p~* — 0, we obtain 
Ey € o(E(X)*) for H’ #4 @. Since the number of possible subgraphs H’ is 
bounded (by an expression involving the constants k and /), this implies that 
E(X?) ~ E(X) + Eg ~ E(X)?. i 


This result generalizes for all H. The ratio d(H) = e(H)/n(A) is the den- 
sity of H, and p(H) = max;rcyd(F) is the maximum density. These are 
equal precisely when H is balanced, and then p = n~/?“#) is the threshold 
for appearance of H. Every graph H has a balanced subgraph F such that 
d(F) = p(H). When pn?” — 0, almost every G’ has no copy of F; hence it 
also has no copy of H. In fact, p = n~1/?“*) is always a threshold function for 
the appearance of H (Exercise 25). 
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EVOLUTION AND GRAPH PARAMETERS 


In the subtitle to his book, Palmer [1985] tells us that random graphs 
involve the study of 


“THRESHOLD FUNCTIONS, which facilitate the careful study of the 
structure of a graph as it grows, and specifically reveal the mysterious 
circumstances surrounding the abrupt appearance of the UNIQUE 
GIANT COMPONENT, which systematically absorbs its neighbors, 
devouring the larger first and ruthlessly continuing until the last ISO- 
LATED VERTICES have been sucked up, whereupon the Giant is 
suddenly brought under control by a SPANNING CYCLE.” 


The evolutionary viewpoint generates random graphs with m edges in a way 
that yields the same probability space as Model B but makes intuitive reason- 
ing easier. Almost everything suggested about random graphs by intuition or 
experimentation is true. The evolutionary viewpoint develops this intuition. 

Generating m edges simultaneously or one-by-one yields the same proba- 
bility distribution, making the graphs with m edges equally likely. By studying 
the likely effect of a new edge on the present structure, we can make intuitive 
hypotheses about the properties of the graph at any stage. A stage of evolution 
is a range of values for m(n) (or p(n)) in which the structural description of the 
typical graph doesn’t change much. We have studied the basic techniques for 
verifying these descriptions, but the computations can be difficult. Hence we 
will only describe the stages using the evolutionary intuition. 

We remark first that a constant multiple of almost nothing is almost noth- 
ing. Therefore, when each of A;,..., A; happens almost always (r is fixed), it 
follows that almost always they all happen. 


Beginning with many vertices and no edges, each new edge is likely to 
be isolated. The random graph is a matching until a substantial fraction of 
the vertices are involved in edges. The thresholds p ~ cn~*/*-) for appear- 
ance of fixed subtrees generalize this. Let 4.(n) = n~/«-). If p/t, > oo but 
P/tr41 — 0, then every fixed subtree on k vertices appears, but none on k + 1 
vertices appears. (The statements about individual trees become statements 
about all trees of that order.) Furthermore, this p is also below the threshold 
for appearance of fixed cycles (density 1, length bounded by k), so G? is a forest 
of trees of order at most k, and every tree on k vertices appears as a component. 

Intuitively, the random graph has no cycles in this stage of evolution be- 
cause when there is no large component a random added edge is much more 
likely to join two components than to lie in one component. To make the intu- 
ition precise, we let X be the number of cycles in G? and compute 


E(X) = Yjg (M)AK — Dip! < jg (np)*/2k. 
If pn —> 0, then E(X) —> 0. 


The next major stage of evolution is p = c/n with O < c < 1. With X 
counting cycles, we can no longer say that E(X) ~ )";_5 (np) /2k, because when 
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k is a substantial fraction of n the ratio n‘/(n), does not approach 1. We must 
break E(X) into two sums, and the arguments become more difficult. When 
pn — c, we find that E(X) approaches a constant c’, and the number of cycles 
in G” is asymptotically Poisson distributed. With cycles in a few components 
and all components small, we still expect the next edge to join two components 
or create a cycle in a component that doesn’t have one. In this range, the size of 
the largest component is about logn, there are many components, each having 
at most one cycle. Most vertices still belong to acyclic components. 

When c reaches and passes 1, the structure of G” changes radically. This 
is called the double jump because the structure of G” is significantly different 
forc <1,c ~1,andc > 1. At pn = 1, the second moment method guarantees 
that almost every G” has a cycle. Also, the order of the largest component 
jumps from logn to n?/3,. For pn = c > 1, the number of vertices outside the 
“siant component” becomes o(7). Also G” is likely to have some cycle with three 
crossing chords and be nonplanar. 

Next, let p approach clnn/n. With c < 1, we have proved that almost ev- 
ery G” has isolated vertices. With c > 1, these disappear. As we add edges toa 
disconnected graph, the edges may go within a component or connect two com- 
ponents. When the components are all small, added edges will almost surely 
join components. Eventually, this results in the creation of a giant component. 
At this point, added edges are likely to lie within the giant component or to join 
it to one of the small components. Of the small components, those most likely 
to receive such edges are the larger ones. In other words, as c passes through 
1 the last remaining small components swallowed by the giant component are 
isolated vertices. This explains intuitively why the threshold for connectedness 
is the same as the threshold for the disappearance of isolated vertices. With 
c > 1, suddenly almost every G” also has a spanning cycle. Minimum degree k 
(and the appearance of the Hamiltonian cycle when k = 2) has a threshold that 
involves a lower-order term: Inn/n + (k — 1) InInn/n. 

The last stages of evolution are those where pn/Inn — oo but p = o(1), 
and then finally p = c; this brings us back to where we began our study. 


When p = clogn/n with c —> ov, we leave the domain of sparse graphs. 
The evolutionary viewpoint becomes less valuable, and we study properties of 
the random graph. We pay less attention to probability threshold functions 
and concentrate on the likely value of graph parameters, especially when p 
is constant. Given a parameter ju, we want to show that u(G?) ~ f(n) for 
almost every G”?. We can view this as a threshold when 1. (G”) is almost always 
between (1 — €) f(n) and (1+ €)f(n), for each e > 0. If u(G”) is almost always 
between f(n) — €g(n) and f(n) + €g(n), where g(n) = o(f(n)), then we have a 
stronger statement, written as u(G”) € f(n)(1+0(1)). 

Some properties that are true for almost all graphs occur in no known 
examples! For the known lower bound on Ramsey numbers, there is still no 
construction of an infinite class of graphs such that a(G) < log ya(n(G)) and 
w(G) < log (n(G)), even though almost all graphs have this property. 

Properties of the random graph can lead to a fast algorithm that solves a 


438 Chapter 8: Additional Topics 


difficult problem on almost all inputs. For example, after stating two results 
about vertex degrees in random graphs, we show how to use properties of the 
degree sequence to design a fast algorithm to test isomorphism “almost always”. 
In the literature of random graphs, w, denotes a function that is unbounded 
but grows arbitrarily slowly. 


8.5.24. Theorem. (Erdés—Rényi [1966]) If p = w, logn/n and « > 0 is fixed, 
then almost every G” satisfies 


(1 —e)pn < 8(G”) < A(G”) < (1+ .€)pn. a 


Most vertices have degree near the average, but there is still considerable 
variation. Bollobas [1982] showed that for p < 1/2, the vertex of maximum 
degree is unique in almost every G” if and only if pn/logn — oo. When we 
complete evolution by returning to the realm of constant edge probability, more 
detailed results are known about the degree distribution. There will almost 
always be some vertices with isolated high degrees before the degrees begin to 
bunch up. Bollobas determined how many distinct degrees can be guaranteed. 


8.5.25. Theorem. (Bollobdés [1981b]) In Model A with p fixed and t é€ 
o(n/logn)'/4, almost every G’ has different degrees for its t vertices of 
highest degree. If t ¢ o(n/ log ny, 4 then almost every G” has d; = d;41 for 
somei <f. a 


n.me/math 


We apply this result to isomorphism testing. No polynomial-time algo- 
rithm is known for this problem, but. Babai—Erdés—Selkow [1980] used the 
degree results for the random graph to develop a fast algorithm that almost al- 
ways works. We define a set H that contains almost all graphs and show that 
isomorphism with a graph in H can be tested quickly. 

The testing is done by a canonical labeling algorithm, which accepts and 
labels a graph in a canonical way if it belongs to H. The desired property is that 
when vertices are labeled as v;,..., v, in one graph and wj,..., w, in another, 
only the bijection mapping v; to w; is a possible isomorphism. Isomorphism can 
then be tested by comparing the adjacency matrices under this labeling. 


8.5.26. Corollary. (Babai—Erdés—Selkow [1980]) There is a quadratic algo- 
rithm that tests isomorphism for almost all pairs of graphs. 


Proof: Given a graph G on n vertices, presented by its adjacency matrix, com- 
pute and sort the vertex degrees, labeling the vertices in decreasing order of 
degree. Fixr = [3lgn]. If d(v;) = d(v;,3) for anyi <r, reject G. Using p = 1/2 
in Theorem 8.5.25 implies that almost every graph successfully passes this test. 

Let U = {v1,...,v,}. With r = |3lgn], there are about n? distinct subsets 
of the vertices of U. Since only n —r vertices remain outside U, there is a chance 
that they can be distinguished by their neighborhoods in U. The set H will be 
all the graphs reaching this stage for which this holds: the vertices of V — U 
have distinct neighborhoods in U. To test this in O(n?) time and complete the 
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labeling, for each x € V—U encode N(x) NU asa binary r-tuple. Evaluate these 
as binary integers, and sort them! These steps take O(n logn) time. Relabel 
the vertices vu,1 to v, AS W,41,..., W, in decreasing order of these values. If two 
consecutive values are the same, reject G. 

If G has passed this far, then G has no nontrivial automorphisms. A graph 
isomorphic to G has only one isomorphism to G, given by applying the canonical 
labeling algorithm to it. The last stage, if both graphs pass canonical labeling, 
is to.compare the adjacency matrices with rows and columns indexed by the 
canonical labeling. The graphs are isomorphic if and only if the matrices are 
now identical. This comparison takes O(n”) time. 

We must show that for almost every G’, the adjacency vectors within a 
specified set of r vertices are distinct for the remaining vertices. If p < 1/2, 
then the probability for any pair x, y that x, y have the same adjacencies in U 
is bounded approximately by (1 — p)’. We say approximately because U is not 
chosen at random; choosing U as the set of vertices of highest degree impairs 
randomness, increasing the probability of a specified edge incident to these 
vertices. Nevertheless, it doesn’t change by much, and the expected number 
of pairs of vertices outside U with identical adjacencies in U is bounded by 
O(("5")(1 — p)’). Given our choice of r, we can bound the base 2 logarithm of 
this by 2lgn — 3lgblgn, where b = 1/(1 — p) > 2 (if p < 1/2). This tends to 
—oo, so almost all graphs have distinct: adjacency vectors in this set. a 

The probability of rejection in this labeling algorithm is bounded by n~1/" 
for sufficiently large n. Later improvements led to an algorithm running in 
time O(n”) with rejection probability c=” (Babai-Kuéera [1979]). 


~ 


CONNECTIVITY, CLIQUES, AND COLORING 


Studying the “typical behavior” of a random structure often involves study- 
ing probability distributions of its parameters. Here we consider connectivity, 
cliques, and colorings for random graphs. 

For random graphs, naive algorithms may become good. For example, find- 
ing a maximum clique is NP-hard. If we know that almost every graph has 
clique number about 2lgn, then we can test all vertex subsets up to size 3lgn 
for being cliques. If w(G) < 3lgn, then this computes w(G), since every set of 
size w(G) + 1 is not a clique. If #(G) > 3lgn, then the algorithm fails to com- 
pute w(G), but this rarely happens. There are too many subsets of size 2lgn 
for this to be a polynomial-time algorithm, but it’s close, and it illustrates one 
way in which the properties of random graphs can be used algorithmically. 

Some NP-hard problems are trivial for random graphs. Although A(G) < 
x'(G) < A(G) + 1 for every simple graph G (Vizing [1964]), deciding between 
these values is NP-hard (Holyer [1981]). Vizing proved that x'(G) = A(G)+1 
only when G has at least 3 vertices of maximum degree. Thus Erdés and Wilson 
[1977], who noted the uniqueness of the vertex of maximum degree when p = 
1/2, also observed that x’(G) = A(G) for the random graph. 
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For sparse graphs and constant k, the thresholds for connectivity k and 
minimum degree k are the same. Does this also hold for constant edge prob- 
ability? Theorem 8.5.18 can be generalized and strengthened to show that if 
k € o(n/logn) and p is fixed, then almost every G? has k common neighbors 
for every vertex pair and hence is k-connected (Exercise 33). Improving this 
requires other methods; Bollobas [1981b] showed for constant p that almost 
every G?’ has connectivity equal to minimum degree. 

What about clique number? For fixed k, Theorem 8.5.23 yields a probability 
threshold for the appearance of a k-clique, but for constant p the clique number 
grows with n. Determining the clique number is NP-complete, but for a random 
graph we can guess the correct value with high probability without looking at 
the graph! Amazingly, for fixed p almost every G’ has one of two possible 
values for the clique number (as a function of n), and for each k € N there is a 
range of n where the clique number almost always equals k. The approach is to 
find bounds on r(n) such that almost every G? has an r-clique and almost none 
has an r + 1-clique. 


8.5.27. Theorem. (Matula [1972]) For fixed p = 1/b and fixed « > 0, al- 
most every G? has clique number between |d — «| and |d+«], whered = 
2 log, n — 2 log, log, n + 1+ 2 log, (e/2). 

Proof: (sketch) If X, is the number of r-cliques, then E(X,) = (") p®). Since r! ~ 

(r/e)’/2xr (Stirling’s approximation), also E(X,) ~ (2ar)~¥/2(enr~1 p&-/?)’, 

Ifr > oo and (enr~!p—/?) < 1, then we expect that E(X,) > 0. To determine 

r(n) such that this holds, take logarithms (base b) in the inequality and solve 

for r to find 


pS Tice — Blo P14 ee. 


This is approximately equivalent to r > d(n) as defined above. More precisely, 
ifr > d+ e, then almost every G’ has no clique of size r. 

The lower bound comes from careful application of the second moment 
method, as in Theorem 8.5.23, but the dependence of r on n makes the analysis 
more difficult. The expectation of X2 sums the probability of common occur- 
rence for all ordered pairs of r-cliques. This probability depends only on the 
number of common vertices, so 


2 _ (™\ 57 (7)\("-7) 20-@ 
a (EGC) : 


We want to show that the term for k = 0 (disjoint ehques) dominates. Let 
E(X®)/E(X,)® = a, + Bn, where a, = (")” (">") and B, = (")> yy (1) ("1 b®). 
We seek a, ~ 1 and f, — 0. When, ~ 2log,n,.an asymptotic formula for 
(7)/(2) leads to a, ~ e/-r) _5 1, The discussion of 8, is more difficult; see 


k 
Palmer [1985, p75-80]. a 


Our study of graph parameters can be applied to measure the strength of 
conditions for Hamiltonian cycles (Palmer [1985, p81-85]). A theorem proves 
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nothing if its hypotheses are never satisfied; this suggests saying that such a 
theorem has strength 0. A theorem is strong if the conclusion is satisfied only 
when the hypothesis is satisfied; then the hypotheses cannot be weakened. 
Define the strength of a theorem to be the probability that its hypotheses are 
satisfied divided by the probability that its conclusion is satisfied. 

Consider sufficient conditions for Hamiltonian cycles. Since p = logn/n is 
a threshold for a Hamiltonian cycle, almost every G’ is Hamiltonian when p is 
fixed. Dirac [1952b] showed that G is Hamiltonian when every vertex degree 
is at least n/2 (Theorem 7.2.8). When p > 1/2, this condition holds for almost 
every G”; when p < 1/2, it almost never holds. Hence the asymptotic strength 
of Dirac’s Theorem is 0 when p is a constant at most 1/2. The same fate befalls 
the other degree conditions of Section 7.2. 

Meanwhile, Chvatal and Erdés [1972] proved that G is Hamiltonian when- 
ever its connectivity exceeds its independence number (Theorem 7.2.19). Our 
thresholds for these parameters imply that this result is strong for every con- 
stant p > 0. We know that a(G’) < 2(1+€) log, n almost always, and we know 
that «(G?) > k almost always (when k = o(n/logn)). Hence «x > @ for almost 
every G’, and the asymptotic strength of the theorem is 1. 


Finally, we consider chromatic number for constant p. Since 1 — p is also 
constant, we can apply the results on clique number: Almost every G”? has no 
stable set with more than (1+ 0(1))2 log, n vertices, where b = 1/(1— p). Hence 
x(G’) > (1/2 + 0(1))n/log,n almost always. Achieving this bound requires 
finding many disjoint stable sets with near-maximum sizes. For a decade, the 
best result was an algorithmic guarantee of a coloring with at most twice the 
number of colors in the lower bound. = 

Bollobas [1988] proved that the lower bound is achievable, by using an- 
other probabilistic technique that guarantees finding enough large stable sets. 
He proved that, in almost every G”, every set having at least n/(log, n)” vertices 
contains a clique of order at least 2 log, n — 5log, log, n. This allows stable sets 
of near-maximum size to be extracted until too few vertices remain to cause 
trouble; the remainder can be given distinct colors. 

Before developing Bollobas’ approach, we present the earlier result for its 
algorithmic interest; the greedy algorithm uses at most (1+ €)n/ log, n colors on 
almost every G’. Thus it “almost always works” as an approximation algorithm 
in the same sense that our earlier isomorphism algorithm almost always works. 
Garey and Johnson [1976] showed there is no fast algorithm that uses at most 
twice the optimum number of colors on every graph unless P = NP. Bollobas’ 
proof does not yield a fast algorithm for coloring almost every graph with an 
asymptotically optimal number of colors; it is an existence proof only. 


8.5.28. Theorem. (Grimmett—McDiarmid [1975]) Given edge probability p, let 
b = 1/(1-— p). For constant p and constant « > 0, almost every G? satisfies 


(1/2 — €)n/log,n < x(G?) < (1+ €)n/ log, n. 


Proof: The lower bound follows using stable sets as suggested above. For the 
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upper bound, we show that the greedy coloring of v),...,v, in order uses at 
most f(n) = (1+ €)n/log,n colors on almost every G” (for simplicity, choose € 
so that f(n) is an integer). Within the set of n-vertex graphs using more colors, 
let B,, be the set such that v,, is the first vertex to use color f, + 1. We prove 
that 5°) _, P(B,) > 0asn > oo. 

Given G, let G,, = Gl{v1,..., Un—1}]. Before color f,, + 1 is used, color f, 
must be used, so for each G € B,, the greedy coloring of G,, uses f, colors. Let 
k; be the number of times color i appears in this coloring. To require use of color 
fn +1, Um+1 must have at least one neighbor of each color 1,...; f,. Given the 
numbers {k;}, the probability of this is iad [1-—(1— p)*]. 

Bollobas and Erdés [1976] simplified the subsequent computations involv- 
ing this bound by observing that the bound is maximized when the k;’s are all 
equal (Exercise 8.3.37). Thus 


Wu -a- py) <-— pf <[1- = py. 


Given G,,, we have b,, = [1 —(1— p)"f™]/™ as a bound on the probability that 
the full graph G belongs to B,,. Since this holds for each G,,, we conclude that 
P(Bn) < b,. This holds for all m, so 5~” _, P(Bm) < nby. 

Using (1 — p)~* < e~, we obtain nb, < ne~f%-?)"" Substituting f, = 
cn/ log, n yields (1 — p)"/f =n~-“°. Thelogarithm of the bound becomes logn — 
cn1-/e/log, n. This tends to —oo for c'> 1, so the probability that the greedy 
algorithm uses more than f(n) colors is bounded by a function tending to 0. 


e/n 


The order of growth of x(G) sheds light on other famous problems in graph 
theory. Hajés conjectured that every r-chromatic graph contains a subdivi- 
sion of K, (see Remark 5.2.21). This was disproved by Catlin [1979] (Exercise 
5.2.40). Erdés and Fajtlowicz [1981] observed that the chromatic number of G? 
almost always grows like ©(n/logn)). On the other hand, the largest r such 
that G? contains a subdivision of K, grows like @(./n)). Thus the chromatic 
number is almost always much larger, and Hajés’ Conjecture is almost always 
very false. 

In contrast, almost every G? has a subgraph contractible to K, when r € 
©(n/./logn). Thus almost every graph satisfies the weaker conjecture of Had- 
wiger (Remark 5.2.21), which states that every r-chromatic graph has a sub- 
graph contractible to K,. 


MARTINGALES 


Advanced techniques in probability lead to elegant results on combinato- 
rial structures without the drudgery involved in second moment and higher 
moment computations. The theory aims to develop paradigms that can be ap- 
plied without repeating computational details. 

Some of these methods employ lists of related random variables. The 
resulting stochastic process displays more consistent and predictable global 
behavior than the individual random variables do. 
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In the classical random walk on a line, at each step there is probability p 
of moving one unit to the left, probability p of moving one unit to the right, 
and probability 1 — 2p of not moving. No matter what the earlier history of 
the walk has been, the expected position after t steps equals the actual position 
after t — 1 steps. This is the defining property of a martingale. 


8.5.29. Definition. A martingale is a list of random variables Xo, ..., X, such 
that the expectation of X;, given the values of Xo,..., X;_1, equals X;_1. 


The expected position of the random walk after n steps is at the origin. 
Less obvious is that the walk is highly unlikely to be very far from the origin, 
as a function of n. We shall see that this follows from its inability to move more 
than one unit in each step. 

Martingales can make it easy to show that a random variable is highly con- 
centrated around its expected value. When the technique applies, it makes the 
detailed computation in the Second Moment Method unnecessary. The hard 
work is accomplished by Azuma’s Inequality, also called the Martingale Tail 
Inequality. This inequality states that if successive random variables in a mar- 
tingale always differ by at most 1, then the probability that X, — Xo exceeds 
./n is bounded by e~*’/?. We first prove two lemmas. These statements hold 
for continuous random variables, but again we consider only discrete variables. 


/m 


8.5.30. Lemma. Let Y be a random variable such that E(Y) = 0 and |Y| < 1. 
If f is a convex function on [—1, 1, then E(f(Y)) < s(f(-1) + f(1)]. In 
particular, E(e'’) < 5(e! +e) forall t > 0. 


Proof: When Y takes only the values +1, each with probability .5, we have 
E(f(Y)) = 5 [f(-—1)+ f())]. For other distributions, pushing probability “out to 
the edges” increases E(f(Y)). For discrete variables, we can use induction on 
the number of values with nonzero probability. Convexity implies that f(a) < 
+4 f(-1) + a+? F(1). If P(Y = a) =a, then we can decrease the probability at 
a to 0, increase P(Y = —1) by a+;4 and increase P(Y = 1) by a att to obtain 
a new variable Y’ with the same expectation. By the convexity inequality and 
the induction hypothesis, E(f(Y)) < E(f(Y’)) < sf (-1) + fl. a 


—lia 1 


8.5.31. Definition. For events A and B, the conditional probability of A 
given B is obtained by treating the event B as the full probability space, 
which means normalizing by P(B). Thus we define P(A|B) = Pans 2) 
When Y, X are random variables, we write Y|X for “Y given X”. This 

defines a random variables for each value of X; we treat X as a constant i 


and normalize the resulting distribution for Y by P(X =i). 


444 Chapter 8: Additional Topics 


For Azuma’s Inequality, we use expectation of conditional variables. For 
each i, we compute the expected value of Y when restricted to the sample points 
where X = i. The expectation E(E(Y|X)) is the expectation of E(Y|X = /) 
over the choices for i, which occur with probability P(X =i). The result is an 
expectation over the entire sample space. It removes the effect of conditioning, 
and we obtain E(E(Y|X)) = E(Y). 


8.5.32. Lemma. E(E(Y|X)) = E(Y). 


Proof: Let p;,; = P(X =i and Y = j). Since E(Y|X =i) = iM, 


E(E(Y|X)) = )) E(V|X =i)P(X =i) = D0) pis = EY). a 
i t J 


8.5.33. Theorem. (Azuma’s Inequality) If Xo, 36 Xp is a martingale with 
|X; — X;-1| < 1, then P(X, — Xo > AJn) < e*”. 


Proof: By translation, we may assume that Xo = 0. For t > 0, we have X, > 
r./n if and only if e'*" > eV", and hence P(X, > AJn) = P(e'*" > ev"), 
Applied to e'*", Markov’s Inequality yields P(e'*" > e?") < E(e'*")/e"", This 
bound holds for each t > 0, and later we will choose ¢ to minimize the bound. 
First we prove by induction on n that E(el*) < 5 (e! +e‘). We introduce 


X,-1 to condition on it. Lemma 8.5.32 yields 
E(e'*") aie E(elXr-t el (Xn-Xn-1)) = E(E(e!*r-t et Xn-D 1X4), 


When we condition on X,_1, the value of X,-1 18 constant for the inner ex- 
pectation. Hence we can remove e'*"-! from the inner expectation to obtain 
E(e'*") = E(e'*"" E(e'” |X,_1)), where Y = X, — X,-1. Because {X,} is a mar- 
tingale, E(Y) = 0, and by hypothesis |Y| < 1. Hence Lemma 8.5.30 applies, 
yielding E(e'’|X,_-1) < $(e' +e7'). This itself is now a constant, yielding E(e'*") 
= 5 (e! + e')E(e'*1), The induction hypothesis completes the proof. 

We weaken the bound to a more useful form by observing that 5(e’ + e~') < 
e’/?, This holds because the left side is )>1?*/(2k)! and the right side is 
1 /(2kk!). Hence our original probability is bounded by e””/2-"" for each 
t > 0. We obtain the best bound by minimizing over t. The exponent is 
quadratic; we minimize it by choosing t to solve tn —A,/n = 0, ort = A/./n. The 
resulting bound is e~*’/?, a 


Azuma’s Inequality is one-sided; it bounds the probability that X, is much 
larger than Xo. Since the conditions are symmetric in sign, applying the in- 
equality to {—X;} yields the same inequality for the other tail, in which X,, is 
much smaller than Xo. 


8.5.34. Example. The pragmatic gambler. A gambler can bet up to n times, 
where n is fixed. Each time he bets, he wins or loses 1 with equal probability. 
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His goal is winning 4./n, so he stops if he reaches that value. Letting X; be 
his winnings after i games, we have X; = X;-_1 if X;_1 > 4./n, and otherwise 
X; = X;-,1+1, each with probability .5. Hence {X;} is a martingale that changes 
by at most 1 at each step, and Azuma’s Inequality applies. The probability that 
the gambler will earn 4./n is bounded by e/2. Tf, = 1, then there may be a 
reasonable chance of success, but if 1 = 10, then there is little hope. a 


In combinatorial applications, we consider a special type of martingale. We 
have an underlying probability space, and Xo is the expectation of a random 
variable X. The variable X,, is the value of X at one sample point. We define a 
martingale Xo,..., X, that describes a gradual process of learning more about 
the final value X, = X. 


8.5.35. Lemma. Let X be a random variable defined on a probability space. 
Let Fy D> F, D--- D F, be a chain-of subsets of the space, where Fo is the 
full space, F, is a single outcome; and F; is a random variable that is a 
block in a partition of F;_;. The probability of choosing F; within F;-_, is 
proportional to its probability in the underlying space. If X; = E(X|F;), 
then the list Xo,..., X,, is a martingale. 


Proof: We must prove that E(X;|Xo,..., X;-1) = X;-1. In a particular in- 
stance of the process, the list of values is the outcome of a particular sequence 
of restrictions. Each sequence of restrictions that generates the given values 
Xo,..., X;-) reaches some F;_, such that E(X|F;_;) has the given value of X;_1. 
For every such F;_;, we can take the expectation of X; over the possible values 
of F;. In each case, we obtain X;_;, so the desired formula holds regardless of 
which F;_, generated the list Xo,..., X;_1. 

We thus condition on a fixed choice of F;_; to compute E(X;|Xo,..., Xj—-1). 
Within F;_;, Lemma 8.5.32 yields E(X;) = E(E(X|F;)) = E(X). This is the 
expectation within the event F;_; (treated as a probability space), so all of 
these expresssions are conditioncd on F;_;, and the final expression is actually 
E(X|Fj-1) = Xi-1. a 


Such martingales, which we call restriction martingales, arise when we 
gradually discover a randomly generated object. Here F; is the subset of the 
probability space where the object is confined after i steps (F for “inForma- 
tion”). In coin-flipping, the sample points are list of length n, and F; may be the 
knowledge of the first i values. In random graphs, F; may be the subgraph in- 
duced by the vertices {v;,..., v;}, or F; may be the knowledge of which among 
the first i edges are present. 
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To apply Azuma’s Inequality, we need to bound |X; — X;-1|. The knowledge 
of which edges arise incident to a fixed vertex v; can change the chromatic 
number by at most 1, since x (G — v;) equals x(G) or x(G) —1. From this we can 
conclude that |X; — X;-1| < 1 in the restriction martingale defined by revealing 
vertices one by one. 


8.5.36. Lemma. Consider a random structure specified by independent steps 
S1,...; S,. Let F; be the knowledge of S;,..., 5;, and let Xo,..., X, be the 
corresponding restriction martingale for a random variable X. Let A be 
the knowledge of S; for all j 4 i, with 5S; unknown. If for each such A the 
values of X on points in A differ by at most 1, then |X; — X;_1| < 1 for alli 
(and hence P(X — E(X)) = A./n) < e*’/2), 

Proof: Consider a particular instance of F;_1, with X;_1 = E(X|F;-1) given. 

We arrange the points of F;_; in the cells of a grid. For all these points, the 

outcomes of S;,..., S;_1 are the same. Each row is a choice for F;: a block in the 

partition of F;_;. Each column is an A in which S$;4;,..., 5, are fixed and only 

S; varies. By hypothesis, in each column the maximum and minimum values 

of X differ by at most 1. Let m,, M, be the minimum and maximum of X in 

column s. 


SY) 
~ 


Choices of A (S41, 


.., 5S, fixed within column) 


efoto) 


Because 5; and S;,1,..., 5, are specified independently, the probability of 
the outcome in row r and column s is qg,p;, where gq, is the probability that §; 
yields this row and p, is the probability that S;,,,..., 5, yields this column. 
The computation of X; is the expectation across a single row: 


Di msps < E(XIFi) < DoMsps < 1+) mp. 


Since these upper and lower bounds are independent of the row index, taking 
the expectation over the entire grid to compute X;_; yields the same inequali- 
ties. Hence X;_; and X; are confined to a single interval of length 1 and differ 
by at most 1. Therefore, Azuma’s Inequality applies. a 


When the conditions of Lemma 8.5.36 hold, we conclude immediately that 
the value of X is highly concentrated around its mean. 


8.5.37. Example. Chromatic number of random graphs. Fix n, and consider 
Model A with edge probability p. Suppose we reveal the random n-vertex graph 
one vertex at a time. At stage i, we learn the edges from v; to the previous 
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vertices; this is S;, and Model A specifies the outcomes of the S;’s independently. 
The event A in which all but S; are specified is the subgraph G — v; of the 
random graph G plus the knowledge of edges from v; to later vertices. Since 
x(G — v;) < x(G) < x(G — v;) +1, the value of X differs by at most one over all 
possibilities in A. The hypotheses of Lemma 8.5.36 hold. Using both tails, we 
conclude that 


P(|\x(G) — E(x(G))|) = avn) < oe, / 


The result of Example 8.5.37 says nothing about the value of E(x(G)). 
To approximate this we again use Azuma’s Inequality. With constant edge 
probability p, we know that the clique number of G” is almost always within 1 
of d = 2 log, n — 2 log, log, n + 1+ 2log,(e/2), where b = 1/p. The same result 
holds for stable sets using the base c = 1/(1 — p) for the logarithm. To show 
that the chromatic number of G’ is close to n/(2 log. n), Bollobas showed that it 
is possible to extract stable sets of almost the maximum size until the number 
of vertices remaining is too small to matter. 


8.5.38. Theorem. (Bollobds [1988]) For almost every G’ with constant p = 
1 — 1/c, every induced subgraph of order at least m = [n/ log” n| has a 
stable set of size at least r = 2log.n — 5 log, log. n. 


Proof: (sketch) We use r-staset, by analogy with r-clique, to mean a stable set 
of size r. Let § be a set of m vertices. ‘We bound the probability that S has no 

r-staset by e~¢”"' for some d, ¢. This in turn bounds the probability that there 
exists an m-set with no r-staset by (")e74""* < 2"e-4"""| Since n = m!+°, this 
bound goes to 0, and the first moment method implies that almost every GP has 
no bad m-set. 

It suffices to study the subgraph G induced by [m]. Let X be the maximum 
number of pairwise pair-disjoint r-stasets in this subgraph, where pair-disjoint 
means they share at most one vertex. We will show that X > 1 almost always. 
To do this, it suffices to show that (1) X is highly concentrated around its mean, 
and (2) E(X) is bigger than something large (and growing). 

We invoke Azuma’s Inequality for (1). Consider the restriction martingale 
for X that results from revealing G one edge-slot at a time. At each step, we 
learn whether one additional pair of vertices induces an edge. We have Xp = 
E(X) and X (my = X. The status of one edge slot changes the value of X by at 


most 1, so Lemma 8.5.36 applies, and P(X — E(X)) < —A(9) M2) < e-¥/2. With 
A= E(x)/(")””, we have 


P(X =0) = P(X — E(X) < —E(X)) < e EO" mm) 


Hence it suffices to show that E(X)/m — oo. 

To prove this, we consider another random variable X, the number of 
r-stasets in G that have no pair in common with any other r-staset. Such 
a collection forms a pairwise pair-disjoint collection of r-stasets, so X > Xx. 
We introduced X because the restriction martingale for X does not satisfy 
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| Co on 1| < 1. In the drawing of G in the figure below, for example, we have 


r = 4 and seek 4-cliques; if the last (dotted) edge is present in G (absent in G), 
then X = 0, but if it is absent from G (present in G), then X = 3. 


NAN 
VW 


It is easier to compute E (X) than E(X). Expressing X as the sum of (”) 
indicator variables, we obtain E (X )as hig ) times the:probability that [r] induces 


an r-staset that is pair-disjoint from all others. This is (1 — p)) times the 
conditional probability that [r] does not conflict with other r-stasets, given the 
event Z that [r] is in fact independent. Let Y be the number of other r-stasets 
overlapping [r] in at least two elements. By Markov’s Inequality, E(Y|Z) > 0 
implies P(Y = 0|Z) — 1. Since each set counted shares at least two vertices 
with [r], we have 


=books 


EVVIZ)= dso O¢ ™")(1 — p)@)-G), 


Asm ~ oo, this tends to 0; this follows from the expression for r in terms 
of m. Hence E(x ) is asymptotic to (” ya - p)®), The expression for r in terms 
of m yields E(X) € Q(m*/), Thus E(X)/m — oo, which completes the proof. @ 


8.5.39. Corollary. (Bollobas [1988]) For constant edge probability p = 1—1/c, 
almost every G? satisfies 


(1+ €)n/(2log.n) < x(G?) < (1+ €’)n/(2log.n), 
where € = log, log.n/log.n and e’ = 5log. log. n/log,n. 


Proof: The lower bound holds because almost every G? has no stable set larger 
than 2log.n ~ 2log, log.n. The upper bound follows from Theorem 8.5.38, be- 
cause we can almost always select stable sets of size 2 log. n — 5log, log. n until 
we have only n/ lg” n vertices left. Since n/ lg” n € o(n/ log.n), we can complete 
the coloring by using distinct new colors on the remaining vertices. a 


EXERCISES 


8.5.1. (—) Expectation. 
a) Compute the expected number of fixed points in a random permutation of [n]. 
b) Determine the expected number of vertices of degree k in a random n-vertex 
graph with edge probability p. 


8.5.2. (—) Prove that 1 — p < e~? for p> 0. 
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8.5.3. (—) Determine the expected number of monochromatic triangles in a random 
2-coloring of E( Ke). 


8.5.4. (—) Prove that some 2-coloring of the edges of K,,., has at least (")(")2t-rs 
monochromatic copies of K,.. 


8.5.5. (—) The statement “f(G,,) < (1+ ¢)n” means that for all « > 0, the inequality 
holds for sufficiently large n. The statement “f(G,,) < n+o(n)” means that f(G,)/n > 1 
as n — oo. Prove that these two statements are equivalent. 


8.5.6. Compute explicitly the probability that the Hamiltonian closure of a random 
graph with vertex set [5] is complete. 


8.5.7. Let G be a graph with p vertices, g edges, and automorphism group of size s. Let 
n = (sk?~1)1/?, Prove that some k-coloring of E(K,) has no monochromatic copy of G. 
(Chvatal—Harary [1973]) 


8.5.8. (!) a) Use a random partition of the vertices to prove that every graph has a 
bipartite subgraph with at least halfits edges. 

b) Use equipartitions of the vertices to improve part (a): if G has m edges and n 
vertices, then G has a bipartite subgraph with at least m aoe ee edges. 
8.5.9. An army of computers is configured as a complete k-ary tree with leaves at 
distance / from the root. At a fixed time, each node is working with probability p, in- 
dependently of other nodes. When a node is not working, the entire subtree below it is 
inaccessible. What is the expected number-of nodes accessible from the root? 


8.5.10. Let G be a matching of size n. Select a set of k vertices at random. Compute the 
expected number of edges induced by the selected vertices. 


8.5.11. Consider a drawing in the plane of a simple graph G with n vertices and m 
edges, where m > 4n. Let H be a random induced subdrawing, generated by letting 
each vertex be retained with probability p, independently. Let Y be the number of 
edge crossings in H. Let X = Y — [e(H) — (8n(A) — 6)]. Use expectations to prove 
that 3n + p?v(G) — pm > 0, and conclude that v(G) > m°/[64n7], where v(G) is the 
minimum number of crossings in a drawing of G. (Comment: This is an alternative 
proof of Theorem 6.3.16.) 


8.5.12. Given a random permutation of the vertices of a simple graph G, orient each 
edge toward the vertex with higher index in the permutation. Compute the expected 
number of sink vertices (outdegree 0) in the resulting orientation. In terms of n(G), de- 
termine the minimum and maximum values of this expectation. Prove that the probably 
of having only one sink is at most e(G)/ es . (Jeurissen [1997]) 


8.5.13. (!) A hypergraph consists of a collection of vertices and a collection of edges; if 
the vertex set is V, then the edges are subsets of V. The chromatic number x(#) ofa 
hypergraph H is the minimum number of colors needed to label the vertices so that no 
edge is monochromatic. A hypergraph is k-uniform if its edges all have size k. 

a) Prove that every k-uniform hypergraph with fewer than 2‘—! edges is 2-colorable. 
(Erdés [1963]) 

b) Use part (a) to prove that if each vertex of an n-vertex bipartite graph has a list 
of more than 1 + lgn usable colors, then a proper coloring can be chosen from the lists. 


8.5.14. (!) Use the deletion method to prove that a graph with n vertices and average 
degree d > 1 has an independent set with at least n/(2d) vertices. (Hint: Choose a 
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random subset by including each vertex independently with a probability p to be chosen 
later. Compute the expected number of edges induced.) 


8.5.15. The maximum size of an n-vertex graph not containing H is ex(n; H). Use the 
deletion method to prove that ex(n; C,) € Q(n'*/*-)), (Comment: One can also show 
that ex(n; C,) € O(n!*?/*) by considering the average degree.) (Bondy—Simonovits) 


8.5.16.. (!) For n € N, prove that R(k,k) > n- (")22-@), Use this to conclude that 
R(k, k) > (1/e)(1 — 0(1))k2*/?. ) 


8.5.17. For natural numbers n,t, let m =n — (yo Prove that there is a 2-coloring 
of the edges of K,,.,, with no monochromatic copy of K,,,. 


8.5.18. (+) Off-diagonal Ramsey numbers. Suppose that 0 < p < 1. 
a) Prove that if (") p® + (re! - p) (2) <1, then R(k, 1) >n. 
b) Prove that R(k, 1) > n — (") p@ = (ore! — p)@) for alln € N. 


c) Choose n and p in part (b) to prove that R(3, k) > k°/2-°, What lower bound on 
R(3, k) can be obtained from part (a)? (Spencer [1977]) 


8.5.19. Let H be a graph. For constant p, prove that almost every G” contains H as an 
induced subgraph. 


8.5.20. a) Fix k,s,t, p. Prove that almost every G’ has the following property: for 
every choice of disjoint vertex sets S, T of sizes s,t, there are at least k vertices that are 
adjacent to every vertex of S and no vertex-of T. (Blass—Harary [1979]) 

b) Conclude that almost every G” is k-connected. 

c) Apply the same argument to random tournaments: almost every one has the 
property that for every choice of disjoint vertex sets 5, T of sizes s,t, there are at least k 
vertices with edges to every vertex of S and from every vertex of T. 


8.5.21. Arandom labeled tournament is generated by orienting each edge v;v; as vj > 0; 
or v; — v; independently with probability 1/2. 

a) Prove that almost every tournament is strongly connected. 

b) In a tournament, a “king” is a vertex such that every other vertex can be reached 
from it by a path of length at most 2. It is known that every tournament contains a king. 
Is it true that in almost every tournament every vertex is a king? (Palmer [1985]) 


8.5.22. Find a threshold probability function for the property that at least half the 
possible edges of a graph are present. How sharp is the threshold? 


8.5.23. For p = 1/n and fixed € > 0, show that almost every G’ has no component with 
more than (1+ €)n/2 vertices. (Hint: Instead of trying to bound the probability directly, 
show that it is bounded by the probability of another event, which tends to 0.) 


8.5.24. Determine the smallest connected simple graph that is not balanced. 


8.5.25. Extend the second moment argument of Theorem 8.5.23 to prove that n~'/e() 
is a threshold function for the appearance of H as a subgraph of G’, where p(G) = 
maxgcy e(G)/n(G). (Bollobds [1981a], Ruciriski—Vince [1985]) 


8.5.26. Let @ be the following graph property: for every choice of disjoint vertex sets 
S,T of size clgn, there is an edge with endpoints in S and 7. Prove that almost every 
graph has property Q ifc > 2. (Comment: This implies that the random graph has 
bandwidth at least n — 2logn.) 
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8.5.27. Prove that ifk = lgn — (2+ €)lglgn, then almost every n-vertex tournament 
has the property that every set of k vertices has a common successor. 


8.5.28. A tournament is transitive if it has a vertex ordering u1,...,u, such that u; > 
u; if and only ifi < j. Prove that every tournament has a transitive subtournament 
with lgn vertices, and almost every tournament has no transitive subtournament with 
more than 2lgn + c vertices if c is a constant greater than 1. 


8.5.29. (!) The Coupon Collector. 

a) Consider repetitions of an experiment with independent success probability p. 
Prove that the expected number of the trial on which the first success occurs is 1/p. 

b) Every box of a certain type of candy contains one of n prizes, each with proba- 
bility 1/n. Receiving the grand prize requiries obtaining each of these prizes at least 
once. Prove that the expected number of the box on which the last prize is obtained is 
n>. 1/i. 

2 ¢) Prove that m(n) = nInn + (k — 1)nInInn is a threshold function for the num- 
ber of boxes needed to obtain at least k copies of each prize. (Hint: Prove that when 
p = 0(1) and k is constant, the probability of at most k successes in m trials with success 
probability p is asymptotic to the probability of exactly k successes.) 


8.5.30. Prove that the length of the longest run in a list of n random heads and tails is 
(1+ 0(1))lgn. In other words, for « > 0, almost no list has at least (1+¢) lgn consecutive 
identical flips, and almost every list has atleast (1 — €)lgn consecutive identical flips. 


8.5.31. With p = (1 —«)logn/n, find a large m such that.almost every graph has at 
least m isolated vertices. What m(n) results from Chebyshev’s Inequality? 


8.5.32. Given a graph G, say that ak-set § is bad if G has no vertex v such that § € N(v). 
For fixed p, how large can k be so that almost every G’ has no bad k-set? How slowly 
can k grow so that almost every G’ hasa bad k-set? 


8.5.33. By examining common neighbors, prove that if p is fixed and k = o(n/logn), 
then almost every G?’ is k-connected. 


8.5.34. (!) With p = (1 — €) logn/n, how large can m be such that almost every graph 
has at least m isolated vertices? (Hint: Use Chebyshev’s Inequality.) 


8.5.35. Ar-interval is a subset of R that is the union of at most t intervals. The interval 
number ofa graph G is the minimum ¢ such that G is an intersection graph of t-intervals 
(each vertex is assigned a set that is the union of at most ¢ intervals). Prove that al- 
most all graphs (edge probability 1/2) have interval number at least (1 — 0(1))n/(4lgn). 
(Hint: Compare the number of representations with the number of simple graphs. 
Comment: Scheinerman [1990] showed that almost all graphs have interval number 
(1 + 0(1))n/(21gn).) (Erdés—West [1985]) 


8.5.36. (!) Threshold for perfect matching in a random bipartite graph. Let G be a ran- 
dom subgraph of K,,, with partite sets A, B, generated by independent edge probability 
= (1+ ¢)Inn/n, where € is a nonzero constant. Call S a violated set if |N(S)| < |S|. 

a) Prove that if « < 0, then almost every G has no perfect matching. 

b) Let S be a minimal violated set. Prove that |N(S)| = |S|— 1 and that G[S U N(S)] 
is connected. 

c) Suppose that G has no perfect matching. Prove that A or B contains a violated 
set with at most [n/2] elements. 

d) For r,s > 1, the number of spanning trees of K,, is r°~1s’~!. Use this, part (b), 
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part (c), and Markov’s Inequality to prove that ife > 0, then G almost surely has a per- 
fect matching. (Hint: A summation in the bound on the expected number of minimal 
violated sets can be bounded by a geometric series.) 


8.5.37. Suppose that 0 < p < 1 and that kj,...,k, are nonnegative integers summing 
tom. Prove that [];_,[1-— (1 — p)*] <[1-(1— py"). 


8.5.38. Tail inequality for binomial distribution. JV.et X >. X;, where each X; is 
an indicator variable with success probability P(X; = 1) = .5, so E(X) = n/2. Ap- 
plying Markov’s Inequality to the random variable Z = (X — E(X))* yields P(|Z| > 
t) < Var(X)/t®. Setting t = a./n yields a bound on the tail probability: P(|X — np| > 
a./n) < 1/(2a7). Use Azuma’s Inequality to prove the stronger bound that P(|X — np| > 
aJn) < 2e-”. (Hint: Let Y/ = X, — .5. Let F; be the knowledge of Y;,..., Y’, and let 
Y¥; = E(Y|F;).) 


8.5.39. Bin-packing. Let the numbers S = {a,...,a,} be drawn uniformly and inde- 
pendently from the interval [0.1]. The numbers must be placed in bins, each having 
capacity 1. Let X be the number of bins needed. Use Lemma 8.5.36 to prove that 


P(|X — E(X)|) > An) < 2e-¥2, 


8.5.40. (!) Azuma’s Inequality and the Traveling Salesman Problem. 

a) Prove the generalization of Azuma’s Inequality to general martingales: If 
E(X;) = X;-1 and |X; — X;-1| < cj, then P(X, — Xo) = if > OS ene, 

b) Let Y be the distance from « given-point z in the unit square to the nearest of 
n points chosen uniformly and independently in the unit square. Prove that E(Y) < 
c/J/n, for some constant c. (Hint: For a nonnegative continuous random variable Y, 
E(Y) = i, P(Y > y)dy, which can be verified using integration by parts. In order to 
bound this integral, use (somewhere) the inequality 1—a < e~“ and the definite inte- 
gral [> e edt = /r/2.) ep 

c) Apply parts (a) and (b) to prove that the smallest length of a polygon bounding a 
random set of 7 points in the unit square is highly concentrated around its expectation. 
In particular, the probability that this deviates from the expected tour length by more 
than AcVInn is bounded by Qe 2 for some appropriate c. (Hint: For the martingale 
in which X, is the expected length of the tour when the first i points are known, prove 
that |X, — X;_1| < c(n —i)~/?. Lemma 8.5.36 does not apply directly.) 


8.6. Eigenvalues of Graphs 


Techniques from group theory and linear aiygebra assist in studying the 
structure and enumeration of graphs. 

From linear algebra, we have seen hints of vector spaces and determinants. 
In a graph G with edges e}..... e,,, the incidence vector for a set F C E(G) 
has courdinates a; = 1 when e; € F and a; = 0 when e; ¢ F. Let C be the set of 
incidence vectors of even subgraphs (those with all vertex degrees even), and 
let B be the set of incidence vectors of edge cuts. Because these sets are closed 
under binary vector addition, C and B are vector spaces (F'xercises 1~2), called 
the cycle space and bond space of G. Since an even subguaph and =.n edge cut 
share an even number of edges, C and B are orthogonal. This is clesciv related 


‘ 
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to the duality between cycles and bonds in Theorem 6.1.14 and Corollary 8.2.42 
and to the use of determinants in the Matrix Tree Theorem (Theorem 2.2.12). 
For further discussion of these vector spaces, see Biggs [1993, Part 1]. 

Groups arise in studying graph isomorphism, embeddings, and enumera- 
tion. The automorphisms of a graph form a group of permutations of its vertices. 
Group-theoretic ideas lead to algorithms for testing isomorphism and to con- 
structions for embedding on surfaces. Conversely, every group can be modeled 
using graphs. An introduction to this interplay appears in White [1973]; see 
also Gross—Yellen [1999, Chapters 13-15]. 

We restrict our attention to eigenvalues of adjacency matrices. We inter- 
pret the characteristic polynomial in terms of subgraphs, relate the eigenvalues 
to other graph parameters, and characterize the sets of eigenvalues for bipar- 
tite graphs and regular graphs. We close with applications to expander graphs 
and the “Friendship Theorem”. An encyclopedic discussion of graph eigen- 
values appears in Cvetkovic—Doob-Sachs [1979]. Chung [1997] presents the 
modern approach, modifying the adjacency matrix in a way that normalizes 
the eigenvalues and yields analogous results that hold more generally. For our 
brief presentation, we use the classical version. 


THE CHARACTERISTIC POLYNOMIAL 


math_ 


8.6.1. Definition. The eigenvalues ofa matrix A are the numbers A such that 
Ax = Ax has a nonzero solution vector; each such solution is an eigenvec- 
tor associated with A. The eigenvalues of a graph are the eigenvalues of 
its adjacency matrix A. These are the roots Aj...., i, of the character- 
istic polynomial ¢(G:; A) = det(AJ — A) = [];_,( — 4;). The spectrum 
is the list of distinct eigenvalues with their multiplicities m,,...,m,; we 
write Spec (G) = (/"""""). 

8.6.2. Remark. Elementary properties of eigenvalues. 

0) The eigenvalues are the values A such that the square matrix AJ — A is 
singular, which is equivalent to det(AJ — A) = 

1) }/ A; = Trace A. The trace is the sum of the diagonal elements and is 
the coefficient of 4”~1 in det(AJ — A). Since det(AJ — A) = []/_,( — Aj), that 

coefficient is also )'A;. For simple graphs, it is 0. 

2) [[a; = (-1)"@(G; 0) = detA = >, sign(o) []}_, ac), where the sum 

runs over permutations o of [n]. 

3) For a symmetric real n-by-n matrix A and A € R, the multiplicity of 1 as 

an eigenvalue of A is n — rank(Al — A). 

4) Adding c to the diagonal shifts the eigenvalues by c, since a + c is a root 

of det(A/ — (ci + A)) if and only if @ is a root of det(A/ — A). 


8.6.3. Example. Spectra of cliques and bicliques. The adjacency matrix of K,, 
is J — J, where J is the matrix of all 1s. Hence the eigenvalues of K,, are 1 less 


than fate of J. Since Spec J = (/ ’ ), we have Spec K, = ea . i 


1 n-1/? 
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The adjacency matrix of K,,, has rank 2, so it has two nonzero eigenval- 
ues Aj, Ag. The trace is 0, so Ay = —A9; call this constant b. Hence $(K,,,, A) = 
X" — b?a"-2_ We compute b using $(G; A) = det(A/J — A). Since 4 appears only 
on the diagonal, contributions in the permutation expansion to the coefficient 
of X"-? arise only from permutations that use n — 2 positions on the diago- 
nal. The remaining two positions must be —a, ; and —a,; for some i, j. There 
are mn nonzero contributions of this form, all negative. Hence b? = mn, and 
Spec (Kinn) = ore eels a 

We index the coefficients of the characteristic polynomial so that ¢(G; A) = 
Yo-9 iA" !. Since $(G; A) = det(AJ — A), always cp = 1 and c; = —Trace A = 0. 
Our computation of co for Km, extends to all graphs. 


8.6.4. Definition. A principal submatrix of a square matrix A is a submatrix 
selecting rows and columns with the same indices. 


Since contributions to c2A”~* involve n — 2 factors of A from the diagonal, 
the coefficient cz is the sum of the principal 2 x 2 subdeterminants of —A. For 
a simple graph, —a;,; is —1 when v; + v; and 0 otherwise, so cp = —e(G). 

Similarly, cz is the sum of the principal 3 x 3 subdeterminants of —A. 
For triple i, j,k, the determinant depends only on the number of edges among 
vj, vj, y%. The determinant is 0 unless-they form a triangle, and then it is —2. 
Hence c3 is —2 times the number of 3-cycles in G. 

Since principal submatrices are the adjacency matrices of induced sub- 
graphs, in general we have c; = (—1)' ¥)),,_; det A(GIS]). 


8.6.5. Theorem. (Harary [1962b]) Given a simple graph G, let H be the set of 
spanning subgraphs in which every component is an edge or acycle. Ifk(H) 
and s(H) denote the number of components of H and the number of compo- 
nents that are cycles, respectively, then det A(G) = )°yey(—D" M2) 


Proof: The determinant formula is det A = >-, (—1)' [] ai.cc), where the sum 
is over permutations of [n] and t(o) is the number of row exchanges (transposi- 
tions) needed to put the positions i, o(i) on the diagonal. When A is a 0,1-matrix, 
the contribution from o is nonzero if and only if these entries all equal 1. 

We view such ao as a vertex permutation mapping each vu; to v,,;). This 
partitions V(G) into orbits. Since a;.,4) = 1 means v; @ v,_), there are no orbits 
of size 1, orbits of size 2 correspond to edges, and longer orbits correspond to 
cycles. Thus the permutation makes a nonzero contribution when it describes 
a spanning subgraph H of G in which the components are edges and cycles. 

The sign of the contribution is determined by the number of transpositions 
needed to move the entries to the diagonal. Row exchanges move one element 
of an orbit at a time to the diagonal, but the last switch moves the last two 
elements to the diagonal. Hence t(o) = n(H) —k(H). Finally, each cycle of 
length at least 3 in H can appear in one of two ways in the permutation matrix, 
since we can follow the cycle in one of two directions. Hence the number of 
permutations that give rise to H is 2°”). a 
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8.6.6. Corollary. (Sachs [1967]) Let H; denote the collection of i-vertex sub- 
graphs of a simple graph G whose components are edges or cycles. The 
characteristic polynomial of G is }°c;A""', where c; = )_,, cH (-1)\" get). 


Proof: This follows from Theorem 8.6.5 and the earlier observation that c; = 
(-D! Yjsj; det A(GISI). 7 


This formula leads to a recursive expression for the characteristic polyno- 
mial (Exercise 5). The formula can be used to construct nonisomorphic trees 
with the same characteristic polynomial (and only eight vertices) (Exercise 7). 

We next discuss the properties of eigenvalues for bipartite graphs. 


8.6.7. Proposition. The (i, j)th entry of A* counts the v;, v;-walks of length k. 
The eigenvalues of A‘ are the kth powers of the eigenvalues of A. 


Proof: The statement about walks holds easily by induction on k (Exercise 
1.2.30). For the second statement, Ax = Ax implies A‘x = A‘x, by repeated 
multiplication. Using an arbitrary eigenvector x ensures that the multiplicities 
of the eigenvalues don’t change. | 


8.6.8. Lemma. If G is bipartite and A is an eigenvalue of G with multiplicity 
m, then —d is also an eigenvalue with multiplicity m. 


Proof: Adding isolated vertices to give the partite sets equal size merely adds 
rows and columns of 0’s to the adjacency matrix, which does not change the 
rank and hence changes the spectrum only by including one extra 0 for each 
vertex added. Hence we may assume that the partite sets have equal sizes. 

Since G is bipartite, we can permute the rows and columns of A to bb- 
tain the form A = ( a an where B is square. If A is an eigenvalue associated 
with eigenvector v = (*) (partitioned according to the bipartition of G), then 
hu = Av= ee 0) (.) = (nr): Hence By = Ax and B’x = Ay. 

Let v’ = (“,). We compute Av’ = ("7") = (5) = -Av’. Hence v’ is an 
eigenvector of A for the eigenvalue —\. Furthermore, m independent eigenvec- 
tors for 4 yield m independent eigenvectors for —A in this way. Hence —A is an 


eigenvector of A with the same multiplicity as i. a 


8.6.9. Theorem. The following are equivalent statements about a graph G. 

A) G is bipartite. 

B) The eigenvalues of G occur in pairs );, A; such that 4; = —A,. 

C) (G; A) is a polynomial in 2. 

D) 37j_, 47"! = 0 for any positive integer r. 
Proof: We proved A => B in the lemma. 

BSc: (A —A;)(A —A,;) = (A? — a) if and only if A; = —A;. Hence the roots 
occur in such pairs if and only if ¢(G; A) is a product of linear factors in A?. 

B=>D: Ifa; = —A;, then ia = a 

D=>A: Because )~ Af counts the closed k-walks in the graph (from each 
starting vertex), condition D forbids closed walks of odd length. This forbids 
odd cycles, since an odd cycle is an odd closed walk, and hence G is bipartite. & 
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LINEAR ALGEBRA OF REAL SYMMETRIC MATRICES 


Relating eigenvalues to other parameters requires several results from lin- 
ear algebra, including the Spectral Theorem and Cayley—Hamilton Theorem for 
real symmetric matrices. These are usually stated in more generality, but ad- 
jacency matrices are real and symmetric, and here the theorems have shorter 
proofs. We begin with a lemma that follows from the Spectral Theorem when 
the latter is proved using complex matrices. The proofs of these results may be 
skipped, especially by readers well-versed in linear algebra. 


8.6.10. Lemma. If f(x) = x’ Ax, where A is a real symmetric matrix, then f 
attains its maximum and minimum over unit vectors x at eigenvectors of 
A, where it equals the corresponding eigenvalues. 


Proof: The function f is continuousin x;,...,x,. For constrained optimization, 
we use Lagrangian multipliers. Given the constraint x’x = 1, we let g(x) = 
x’x —1. Forming L(x,A) = f(x) — Ag(x), the extreme values occur where all 
partial derivatives of L are 0. With respect to A, this yields x7x = 1. 

Let V denote the vector of partial derivatives with respect to x1,...,x,. We 
compute VL(x,A) = Vf(x) —AVg(x) = 2Ax — 2Ax. The statement V f(x) = 
2Ax uses the symmetry of A. We have-VL = 0 precisely when Ax = Ax, which 
nequires x to be an eigenvector of A for eigenvalue X. This yields f(x) = Ane I 
Axl x =A, a 


me/math_ 


Since our variables in the optimization are real, we have found at least one 
real eigenvector and eigenvalue. We can use this inductively to show that all 
eigenvectors have this property. 


tele 


8.6.11. Theorem. (Spectral Theorem) A real symmetric n x n matrix has real 
eigenvalues and n orthonormal eigenvectors. 


Proof: We use induction on n. The claim is trivial for n = 1; considern > 1. Let 
v, be the eigenvector maximizing x’ Ax. Let W be the orthogonal complement 
of the space epanned by v,; it has dimension n — 1. If w € W, then v7 Aw = 
w' Av, = A,w’v, = 0. Hence Aw € W. Viewing multiplication by A asa 
mapping f,, we have f,: W > W. 

Let S be a matrix whose columns are the vectors of an orthonormal basis 
of R' with v, as the last column. Since the basis is orthonormal, S~! = $7. The 
matrix for f, with respect to this basis is S’ AS. Since the basis is orthonormal 
and v, is an eigenvector, the last column of S7 AS is 0, except for ,, in the last 
position. Furthermore, the matrix is symmetric. Hence its first n — 1 rows and 
columns form the matrix A’ for f, on W with respect to this basis. 

By the induction hypothesis, A’ has orthonormal eigenvectors vj, ..., U,—1, 
with real eigenvalues. Using S, we convert these back into real eigenvectors for 
A. They have the same real eigenvalues, and they form an orthonormal set. @ 


_N ext we consider oe functions of a matrix. Viewed as peng of 
*, the matrices /, A, A2,..., A” cannot be independent, since there are n? + 1 
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of them. Using an equation of linear dependence, we obtain a polynomial p 
such that p(A) is the zero matrix. The characteristic polynomial itself suffices. 
This holds for all A, but again we consider only real symmetric matrices. 


8.6.12. Theorem. (Cayley—Hamilton Theorem) If ¢(A) is the characteristic 
polynomial of a real symmetric matrix A, then ¢(A) is the zero matrix (A 
“satisfies” its own characteristic polynomial). 


Proof: Let the eigenvalues of A be Aj,...,An, 80 (A) = [];_,( — Aj). Since 
powers of A commute, the matrix polynomial obtained by using A for 4 factors 
as $(A) = [];_,(A —Ai/). To prove that ¢(A) = 0, we need only show that the 
matrix ¢(A) maps every vector to 0. Write an arbitrary vector x as a linear 
combination of the basis of eigenvectors guaranteed by the Spectral Theorem. 
Applying A — A;/ kills the coefficient of v;. Successively multiplying by all the 
factors A — A;/ produces the zero vector. a 


8.6.13. Definition. The minimum polynomial y of a matrix A is the poly- 
nomial of minimum degree satisfied by A and having leading coefficient 1. 
When A is the adjacency matrix of G, we call this the minimum polyno- 
mial ¥(G; A) of G. 


Y 


a) 


The minimum polynomial is unique: if A satisfies two such polynomials of 
the same degree, then A satisfies their-difference, which has lower degree. 
8.6.14. Theorem. The minimum polynomial of A is ¥(A) = [],_, A—A;), where 

{A1,..-,A,} are the distinct eigenvalues of A. 


Proof: The minimum polynomial divides every polynomial satisfied by A, since 
otherwise the remainder would be a polynomial of lower degree satisfied by 
A. The Cayley—-Hamilton Theorem now implies that yw divides ¢ and must 
be the product of some of its factors. Killing the vectors in the subspace of 
eigenvectors for eigenvalue A; requires a factor of the form A —A;/J. This factor 
kills all vectors in that subspace, so we only need one copy of each such factor. @ 


8.6.15. Lemma. (Sylvester’s Law of Inertia) Let A be a real symmetric matrix. 
If x7 Ax can be written as asum of N products of linear expressions, that is 
nt AX = iO ee GimXi)(> jer, 9j.mx;), then N is at least the maximum 
of the number of positive and the number of negative eigenvalues of A. 

Proof: (Tverberg [1982]) Write the linear expressions as u,,(x) and v,,(x). For 

each m, we have uy,(x)Um(x) = L2(x) — M2(x), where L = 5(u+v) and M = 

3(u — v) are also linear combinations of x,,..., x,. This expresses the quadratic 

form as x? Ax = )~*_,[L2(x) — M2(x)]. 

On the other hand, A is a real symmetric matrix and thus has orthonormal 


eigenvectors w!,..., w”. Using this, we write x7 Ax = x7 SAS’ x, where A is the 
diagonal matrix of eigenvalues A; > --- > 4, and S hascolumns w!,..., w”. If S 
has p positive and q negative eigenvalues, then this becomes x’ Ax = )77_,(y' - 
x)? — yg 41 (2! + x)”, where each y’ or z’ is \A; {2/2 wi. 
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Now we consider a homogeneous system of linear equations. We require 
L»(x) = Ofor1 < m < N, also z'-x = 0 forn—g <i<n,andw'-x =Oforp <i < 
n—q. This places N +n — p homogeneous linear constraints on 7 variables. If 
N < p, then these equations have a nonzero simultaneous solution x’. Setting 
x to x’ in the two expressions for x’ Ax yields -?_,(y' - x’)? = —°™_, M2(x’). 
Since x’ is orthogonal to all eigenvectors with nonpositive eigenvalues, the left 
side is positive, while the right is nonpositive. The contradiction yields N > p; 
an analogous argument yields N > q. a 


EIGENVALUES AND GRAPH PARAMETERS 


Eigenvalues provide bounds on various parameters, or alternatively graph 
parameters yield bounds on the eigenvalues. Our first result uses only the 
minimum polynomial. 


8.6.16. Theorem. The diameter ofa graph G is less than the number of distinct 
eigenvalues of G. 


Proof: Let A be the adjacency matrix; A satisfies a polynomial of degree r if and 
only if some linear combination of A®, 3 ., A’ is 0. Since the number of distinct 
eigenvalues is the degree of the minimum polynomial, we need only show that 
A°,..., AX are linearly independent when k < diam (G). 

It suffices to show for k < diam (G) that A* is not a linear combination of 
A®,..., AX, Choose v;, uj € V(G) such that d(v;, vj) = k. By counting walks, 
we lave Ax, Z Obut Aj , = Oforr <k. Therefore, A‘ is not a linear combination 
of the gmeller powers. a 


Since the Spectral Theorem guarantees real eigenvalues, we can index our 
eigenvalues as A; >--- >A,. We also refer to A; and A, aS Amax(G) and Apin(G). 


8.6.17. Lemma. If G’ is an induced subgraph of G, then 
Amin(G) = Amin(G’) = Amax(G’) = Amax(G) . 


Proof: Since A is a real symmetric matrix, Lemma 8.6.10 yields Amin(A) < 
x? Ax < Amax(A) for every unit vector x. Consider the adjacency matrix A’ of 
G’. By permuting the vertices of G, we can view A’ as an upper left principal 
submatrix of A = A(G). Let z’ be the unit eigenvector of A’ such that A’z’ = 
Amax(G’)z’. Let z be the unit vector in R, obtained by appending zeros to 2’. 
Then Amax(G’) = 27 A’z’ = 27 Az < Amax(G). Similarly, Amin(G’) > Amin(G). a 


The behavior of the extreme eigenvalues under vertex deletion is a special 
case of the “Interlacing Theorem”: If G has eigenvalues A; >--- >A, and G—x 
has eigenvalues 4; > --- > u,_1, then A > wy > Ag >--- > My_-1 > An. We will 
not need this and hence omit the proof, which involves only linear algebra. 
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8.6.18. Lemma. For every graph G, 5(G) < a) < hmax(G) < A(G). 


Proof: Let x be an eigenvector for eigenvalue i, and let x; = max; x; be the 
largest coordinate value in x. Then A < A(G) follows from 


Ax; = (Ax); = > xi< d(v;)x; < A(G)x; : 


vj EN(v,) 


For the lower bound, we apply Lemma 8.6.10 to the unit vector with equal 
coordinates. Since the sum of the entries in the adjacency matrix is twice the 
number of edges of G, we have 


Nae > tat == ya, = ase ml 


Lemma 8.6.18 enables us to improve the trivial bound x(G) < 1+ A(G) 
given by the greedy coloring algorithm. Replacing A(G) with the average de- 
gree is too small; K, + K; has chromatic number n and average degree less 
than n — 1. Since Amax is always at least the average degree, 1 + Amax(G) has a 
chance to work and can’t be much improved. 


8.6.19. Theorem. (Wilf [1967]) For every graph G, x(G) < 1+ Amax(G). 


Proof: If ¥(G) = k, then we can successively delete vertices without reducing 
the chromatic number until we obtain a subgraph H such that x(H —v) =k-1 
for all v € V(H). As observed in Lemma 5.1.18, 5(H) > k — 1. Since H is an 
induced subgraph of G, Lemma 8.6.18 and then Lemma 8.6.17 yield 


k <148(H) <14Agix(H) < 1+ Amax(G). . 


Sylvester’s Law of Inertia yields.a lower bound on the number of bicliques 
needed to decompose a graph. Because stars are bicliques and every subgraph 
of a star is a star, the number of bicliques needed is at most the vertex cover 
number 6(G) = n(G) — a(G). Erdés conjectured that equality almost always 
holds, but this remains open. Graphs with special structure may have efficient 
partitions using other bicliques. The general lower bound using eigenvalues 
appears explicitly in Reznick—Tiwari—West [1985], but it is implicit in earlier 
work decomposing the complete graph (Tverberg [1982], Peck [1984]). 


8.6.20. Theorem. For a simple graph G, the number of bicliques needed to de- 
compose G is at least the maximum of the number of positive and number 
of negative eigenvalues of the adjacency matrix A(G). 
Proof: When G decomposes into subgraphs G;,..., G;, we may write A(G) = 
>_}-1 Bi, where B; is the adjacency matrix of the spanning subgraph of G with 
edge set E(G;). When G; is the biclique with bipartition S;, T;, we have x’ Bjx = 
2 Does x; )oxer, Xk- Writing these linear expressions as u;(x) = J/2 Des x; and 
u(x) = Dyer, Xe, we have x?Ax = \Yi_, x" Bix = Y\i_1uj(x)u;(x). Sylvester’s 
Law of Inertia (Lemma 8.6.15) now yields the claimed lower bound. | 
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8.6.21. Example. Biclique decomposition of C(2;41),0C,. There are simple for- 
mulas for the eigenvalues of a cycle (Exercise 6) and for computing the eigen- 
values of a cartesian product from the eigenvalues of the factors (Exercise 10). 
These yield simple formulas for the numbers of positive and negative eigenval- 
ues of C,,0C, when m is an odd multiple of n. In particular, C(2;41), OC, has 
(2t+1)(n?+1)/2 positive eigenvalues and (2t-+1)(n?—1)/2 negative eigenvalues 
when n is odd (0 is not an eigenvalue). 

Furthermore, such a product decomposes into (2¢ + 1)(n? + 1)/2 bicliques, 
consisting of (2 + 1)(n — 1)/2 4-cycles and (21 + 1)(n + 1)/2 stars (Kratzke- 
West). Note that 4-cycles and stars are the only subgraphs of C,, OC, that are 
bicliques. The optimal decomposition of Ci; 0 Cs appears below. Edges wrap 
around from top to: bottom and right to left, and all grid points indicate vertices. 
The heavy dots indicate vertices that are centers of stars in the decomposition, 
and the circles indicate 4-cycles in the decomposition. a 


~ 
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EIGENVALUES OF REGULAR GRAPHS 


Like bipartite graphs, regular graphs can be characterized using spectra. 
The n-vector 1, with all coordinates 1 plays a special role in this and many 
other arguments involving eigenvalues, as does the matrix J of all 1s. 


8.6.22. Theorem. The eigenvalue of G with largest absolute value is A(G) if 
and only if some component of G is A(G)-regular. The multiplicity of A(G) 
as an eigenvalue is the number of A(G)-regular components. 


Proof: Let A be the adjacency matrix. The ith entry of Al, is d(v;). When Gis 
k-regular, we obtain Al, = k1,, and thus k is an eigenvalue with eigenvector 1,. 
In general, let x be an eigenvector for eigenvalue A, and let x; be a coordinate 
of largest absolute value among coordinates of x corresponding to the vertices 
of some component H of G. For the jth coordinate of Ax, we have 


al x] = [An] =| D0 af say) [xy] s A@ [x] . 
v, EN(v;) 
Hence |A| < A(G). Equality requires d(v;) = A(G) and x; = x; for all 
v; € N(v;). We can iterate this argument to reach all coordinates for vertices in 
H. Hence the eigenvalue associated with x has absolute value as large as A(G) 
only if H is A(G)-regular. 
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Thus the eigenvalue associated with an eigenvector x has absolute value 
as large as A(G) if and only if (1) each component of G containing a vertex 
where x is nonzero is A(G)-regular, and (2) x is constant on the coordinates cor- 
responding to each such component. We can choose the constant independently 
for each A(G)-regular component, so the dimension of the space of eigenvectors 
associated with A(G) is the number of A(G)-regular components. | 


When G is connected and not regular, it remains true that eigenvalues of 
largest absolute value have multiplicity 1 and that coordinates of the associated 
eigenvector have the same sign. This is related to the Perron—Frobenius The- 
orem of linear algebra and uses arguments like those above; we omit the proof. 

Powers of the adjacency matrix yield another characterization. 


8.6.23. Theorem. (Hoffman [1963]) A graph G is regular and connected if and 
only if J is a linear combination of powers of A(G). 


Proof: Sufficiency. If J can be so expressed, then for each i, j we have (A“);; 4 0 
for some k > 0, which requires a v,;, v;-walk of length k. Hence G is connected. 
For regularity, consider the matrices JA and AJ. The i, jth position of AJ 
is d(v;) (constant on rows), and the i, jth position of JA is d(v;) (constant on 
columns). Since J is a linear combination of powers of A, each of which com- 
mutes with A, wehave JA=AJ. Thus the i i, jth position is both d(v;) and d(v,) 
and the eraphii is regular. g 
Necessity. Since G is k-regular, k is an eigenvalue, and the minimum poly- 
nomial is y(G: A) = (A — k)g(A) for some polynomial g. Since w(G; A) = 0, we 
have Ag(A) = kg(A). Hence each column of g(A) is an eigenvector of A with 
eigenvalue k. Since G is regular and connected, each such eigenvector is a 
multiple of 1,,. Hence the columns of g(A) are constant. However, g(A) is a lin- 
ear combination of powers of a symmetric matrix and therefore must itself be 
symmetric. Hence the columns are equal and g(A) is a multiple of J. a 


S 


When G is simple and regular, G is also regular, and the eigenvalues of G 
can be obtained from the eigenvalues of G. This rests on the matrix expression 
for complementation: A(G) = J — I — A(G). 


8.6.24. Lemma. ¢(G; A) = (—1)" det[(—A — 1) ~ A(G) + J]. 


Proof: Direct computation yields det(AJ — A(G)) = det(Al — (J —I — A)) = 
det{(A + 1)/ — J +A] =(-1)" det[(—A —1I1-A+ J]. a 


8.6.25. Theorem. If a simple graph G is k-rezular, then G and G have the same 
eigenvectors. The eigenvalue associated with 1, is kin G andn—k—1in 
G. Ifx #1, is an eigenvector of G for eigenvalue A of G, then its associated 
eigenvalue in G is —1 — A. 

Proof: Since G is n — k — 1-regular, 1, is an eigenvector for both G and G, with 

eigenvalue k for G and n — k — 1 for G. Let x be another eigenvector of G in an 

orthonormal basis of eigenvectors, and let A = A(G). Since 1, -x = 0, )> x; = 0. 

We compute Ax = Jx —x — Ax =O—x—-—Ax=(-1-A)x. a 
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This yields a lower bound on the smallest eigenvalue of a regular graph 
and another derivation of the spectrum of K,,. 


8.6.26. Corollary. For a k-regular simple graph, A, > k —n. 


Proof: If G is k-regular and A; >--->A,, then the eigenvalues of G are 
(n—k—1,-1—A,,...,-1—A2), by Theorems 8.6.22-8.6.25. In particular, 
n-k-12>-,,-1. | 


The eigenvalues of a connected regular simple graph G can be used to 
count its spanning trees. The eigenvalues need not be rational, yet the result 
t(G) is an integer. The Matrix Tree Theorem (Theorem 2.2.12) says that t(G) 
equals each minor of Q = D — A, where A is the adjacency matrix and D is 
the diagonal matrix of degrees. When G is k-regular, D = k/. Letting Adj Q 
denote the adjugate matrix of Q (the matrix of signed cofactors), the Matrix 
Tree Theorem is the statement that Adj Q = t(G)J. Using Cayley’s Formula 
(Theorem 2.2.3) for spanning trees in K,,, we have Adj (n] — J) =n"~?J. 


8.6.27. Lemma. Let D be the diagonal matrix of vertex degrees in a simple 
graph G, let A = A(G), and let Q = D — A. The number of spanning trees 
of G is t(G) = det(J + Q)/n?. 

Proof: Observe that J? = nJ, JQ =.0, and Adj(AB) = Adj (A) Adj(B). We 

apply this using J + Q and the matrix n/ — J that arises from K,. We have 


Adj (nI — J) Adj (J + Q) = Adj [(nI — J)(J + Q)] = Adj (nQ), 


since J? = nJ and JQ = 0. We have computed that Adj (nJ — J) =n"~?J. Also, 
Adj (nQ) = n"—! Adj Q for any matrix Q. Canceling common factors of n yields 
J Adj (J + Q) = nt(G)J. Multiplying both sides of this on the right by (J + Q) 
yields J(det(J + Q)/) =nt(G)nJ. Both sides are multiples of J, so the desired 
equality holds. |] 


ks 


DOO. 


| 


We can now compute t(G) from the eigenvalues if G is regular. (This anal- 
ysis extends to all graphs when the modified system of eigenvalues is used.) 


8.6.28. Theorem. If G is a k-regular connected simple n-vertex graph with 
spectrum (," °"), then 1(G) = n71'(G; k) = n71 TTj_o(k — Aj)”. 


1 mo---m, 

Proof: Since J + Q = J +kI — A, the determinant of J + @Q is the value at k 
of the characteristic polynomial of A — J. Since G is k-regular and connected, 
it has 1, as an eigenvector with eigenvalue k, and the other eigenvectors are 
orthogonal to 1,. Every such eigenvector of A is also an eigenvector of A — J, 
with the same eigenvalue, since (A — J)x = Ax — Jx = Ax = Ax. 

Also, 1,, is an eigenvector of A — J with eigenvalue k — n. This produces a 
full set of eigenvalues for A — J. Evaluating the characteristic polynomial at k 
yields det(J + Q) =n TTj-2(k —i,;). The product is ¢’(G; k), since ¢(G; A) has 
i —k as anon-repeated factor when G is k-regular and connected. By Lemma 
8.6.27, we obtain r(G) upon dividing by n?. a 
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The results in Lemma 8.6.24—Theorem 8.6.28 were extended to arbitrary 
(non-regular) graphs in Kelmans [1967b] (also Kelmans—Chelnokov [1974]) us- 
ing the eigenvalues of the Laplacian matrix of the graph. This is the matrix Q 
used above. Another method for counting spanning trees appears in Kelmans 
[1965, 1966], and another variation on the Matrix Tree Theorem appears in 
Hartsfield-Kelmans—Shen [1996]. 


EIGENVALUES AND EXPANDERS 


Many applications in computer science require “expander graphs”. Walters 
[1996] collects many definitions that have been used for such graphs. The basic 
notion of expansion is that all small sets should have large neighborhoods. The 
aim is to establish good connectivity properties without having many edges. 


8.6.29. Definition. An (n,k,c)-expander is an X,Y-bigraph G with |X| = 
|Y| =n such that A(G) < k and that |N(S)| > (1+ c(1 — |S|/n)) - |S| for 
every S C X with |S| < n/2. An (n, k, c)-magnifier is an n-vertex graph G 
such that A(G) < k and that |N(S) S| > c-|S| for every S C V(G) with 
|S| < n/2. An n-superconcentrator is an acyclic digraph with n sources 
and n sinks such that for every set-A of sources and every set B of |A| sinks, 
there are |A| disjoint A, B-paths. 
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Expanders appear in the parallel: sorting network of Ajtai, Komldés, and 
Szemerédi [1983]. The condition for expansion strengthens Hall’s Condition; 
we have not one matching but many. This facilitates using expanders to con- 
struct superconcentrators. Applications of superconcentrators are discussed 
in Alon [1986a]. The bound on maximum degree makes the number of edges 
linear in n, thereby limiting the cost of constructing the network. 

Probabilistic methods (Exercise 22) yield the existence of expanders (and 
superconcentrators) with large n and bounded average degree (Pinsker [1973]), 
Pippenger [1977], Chung [1978b]). Margulis [1973] used algebraic ideas to 
construct an explicit example (see also Gabber—Galil [1981]). 

Although an appropriately generated random graph will almost always 
have good expansion properties, it is hard to measure expansion. Tanner [1984] 
and Alon—Milman [1984, 1985] independently used eigenvalues to remedy this. 
They proved that graphs have good expansion properties when the two largest 
eigenvalues are far apart. Since eigenvalues are easy to compute (or approxi- 
mate), we can generate a graph randomly and then compute its eigenvalues to 
check the amount of expasion. 

We consider only the special case of regular graphs. Expanders are more 
useful than magnifiers in applications, but it is easy to obtain an (n, (k + 1), c)- 
expander from an (n,k, c)-magnifier (Exercise 21). Hence we consider the re- 
lationship between eigenvalues and magnification. Our presentation follows 
that of Alon—Spencer [1992, p119ffl, which discusses additional properties of 
the eigenvalues of regular (and random) graphs. 
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8.6.30. Theorem. If G is a k-regular n-vertex graph with second-largest eigen- 
value A, and S is a nonempty proper subset of V(G), then 


|[S, S]| => (k — A) |S| |S] /n. 


Proof: Since G is k-regular, Amax(G) = k. The claim is trivial if k — A = 0, so we 
may assume that G is connected. We compute 


xP(kI-—A)x =k) x? -2 S 7 xixy = Do Gi — x)’. 


ijeE(G) ijeE(G) 


Now let s = |S| and set x; = —(n —s) fori € S and x; = s fori ¢ S. The sum on 
the right above becomes n? |[S, S]]. 

Because |S| = s implies }* x; = 0, the vector x is orthogonal to the eigen- 
vector 1, of A with eigenvalue k. The eigenvector 1, is also the eigenvector of 
kI — A for its smallest eigenvalue 0. Using Lemma 8.6.10 and Theorem 8.6.11, 
the minimum of Eki Ae over vectors orthogonal to 1, is the next smallest 
eigenvalue of kJ — A, which is k — 4. Hence 


xT (kI — A)x > (k—A)x™x = (k—-A)(s(n—5)? +(n—5)s”) = (K—-A)s(n—)n. 
Since x7 (kJ — A)x =n 


, we have e |IS. SI] > (k —i)s(n — s)/n. a 


book 
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8.6.31. Corollary. If G is ak-regular n-vertex graph with second-largest eigen- 
value A, then G is an (n, k, c)- magnifier, where c = (k — A)/2k. 


Proof: If S is a set of s < n/2 vertices inG, then Theorem 8.6.30 yields [S, S]| > | > 

k — A)s(n — s)/n. Each vertex of S receives at most k of these edges, so S must 
have at least (k — A)s(n — s)/(nk) neighbors i in S. Since (n — s)/n > 1/2, the 
result follows. a 


Greater separation between the two largest eigenvalues yields greater 
magnification. Alon and Milman [1984] improved the lower bound to c > 
(2k —2))/(3k —2A). Alon [1986b] proved a partial converse: If a k-regular graph 
G is an (n, k, c)-magnifier, then the separation k — A is at least c?/(4 + 2c?). 

Explicit constructions of regular graphs are known with separation be- 
tween A; and A2 nearly as large as possible. The second largest eigenvalue of 
a k-regular graph with diameter d is at least 2/k — 1(1 — O(1/d)) (see Nilli 
[1991]). Lubotzky—Phillips-Sarnak [1986] and Margulis [1988] constructed in- 
finite families of regular graphs where, the degree k is 1 more than a prime 
congruent to 1 mod 4 and the second largest eigenvalue is at most 2k — 1. 


STRONGLY REGULAR GRAPHS 


We close with an application to a special class of regular graphs. 


8.6.32. Definition. A simple n-vertex graph G is strongly regular if there are 
parameters k, A, u such that G is k-regular, every adjacent pair of vertices 
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have A common neighbors, and every nonadjacent pair of vertices have u 
common neighbors. 


Properties of eigenvalues of strongly regular graphs provide a short proof 
of a curious result called the “Friendship Theorem”. This theorem says that at 
any party at which every pair of people have exactly one common acquaintance, 
there is one person who knows everyone (presumably the host). The resulting 
graph of the acquaintance relation consists of some number of triangles shar- 
ing a vertex. Another motivation for studying strongly regular graphs is their 
connection with the theory of designs. Strongly regular graphs with 4 = p cor- 
respond to symmetric balanced incomplete block designs. Other regular graphs 
with rich algebraic structure appear in Biggs [1993, part 3]. 


8.6.33. Theorem. If G is a strongly regular graph with n vertices and param- 
eters k, A, u, then G is strongly regular with parameters k’ = n —k — 1, 
N=n—2-—2k+ uy, and p’ =n — 2k 4+). 

Proof: For each adjacent pair v ~ w in G, there are 2(k — 1) — A other vertices 

in N(v) U N(w), so v and w have n — 2 — 2(k — 1) +A common nonneighbors. 

When v # w there are 2k — yw vertices in N(v) U N(w) and thus n — 2k+y 

common nonneighbors. a 


th_ books 


8.6.34. Theorem. If G is a strongly réalar graph with n vertices and param- 
eters k, iA, uw, then k(k —A—-1) = un —k-—1). 


Proof: We count induced copies of P3 with a fixed vertex v as an endpoint. The 
middle vertex w can be picked in k ways. For each such w, the third vertex can 
be any neighbor of w not adjacent to v. With v unavailable, there are always 
k —} — 1 ways to pick the third vertex. On the other hand, the third vertex can 
be picked in n — k — 1 ways as a nonneighbor of v, and for each such choice there 
are . common neighbors with v that can serve as w. | 


8.6.35. Example. Degenerate cases: uw = Oord} =k—lork =n-—1. We 
show that such a strongly regular graph is a disjoint union of k + 1-cliques. By 
Theorem 8.6.34, A = k — 1 if and only if 1 = 0 ork =n—1. Hence we may 
assume that 1 = k — 1. Now every neighbor of v is adjacent to every other, 
which forbids an induced Ps and forces G to be a disjoint union of cliques. SH 


Henceforth, we assume that uw > 0 anda < k — 1. Theorem 8.6.34 states 
a necessary condition on the set of parameters for a strongly regular graph. 


Another necessary condition arises from the eigenvalues. 


8.6.36. Theorem. (Integrality Condition) If G is strongly regular with n ver- 
tices and parameters k, A, , then the two numbers below are nonnegative 


integers. 
Lf yg = Du — A) = 2k 
2 (uw — A)? + 4(k — p) 
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Proof: These are nonnegative integers because they are multiplicities of eigen- 

values. Consider A”. The ijth entry of A? isk ifi = j, is A if vy; @ v;, and 

is w if v1 ~ v;. Since v; < v,; marks the 1s in the adjacency matrix and 

uv; # v; marks the 1s in the adjacency matrix of the complement, we have A? = 

kI+4A+ (J —1—A). Rearranging terms yields A? = (k— pw) +(—p)A+pJ. 
Multiplying 1, by both expressions for A? yields 


k*1, = (k—p)i, + — w)k1, + unl, 


which yields another proof of k(k —A—1) = w(n—k—1). Let x be an eigenvector 
for another eigenvalue 6 # k. Since x is orthogonal to 1,, we have Jx = 0,. 
Multiplying x by both expressions for A? produces 6? — (A — )0 — (k — nw) = 0. 
This quadratic equation for 6 has two roots r, s, which must be the values of all 


the other eigenvalues. The values are 5 (a —ptVJ/A— pw)? +4(k - 1). 


Now let a and b be the multiplicities of the eigenvalues r and s. Example 
8.6.35 describes all cases when » = 0. Hence we may assume that pw > 0, and 
thus G is connected. Because G is connected, eigenvalue k has multiplicity 1, 
and we have 1+a+b =n. Since the eigenvalues sum to 0, we havek+ra+sb = 
0. The solution to these two linear equations for a and b is a = aE and 


b = ‘#@-)) | These are the values claimed above to be nonnegative integers. & 


) 


OC 


Px 


The argument above can also be traced in the opposite direction. 
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8.6.37. Theorem. A k-regular connected graph G is strongly regular with pa- 
rameters k, i, u if and only if it has exactly three eigenvalues k > r > s 
and these satisfy r-+s =A—pandrs = —(k— yp). a 


8.6.38. Example. Classes of strongly regular graphs. We consider two cases: 
(n — 1)(u — A) = 2k and (n — 1)(u — A) F 2k. Excluding the trivial values, the 
first case requires up = 4+ 1, because 0 < 2k < 2n — 2. By Theorem 8.6.33, 
G and G are thus strongly regular graphs with the same parameters. In this 
case, we also know that n = 44+ 1 and that n is the sum of two perfect squares. 
Furthermore, the eigenvalues r and s have the same multiplicity. 

In the second case, rationality requires (4 — 4)? + 4(k — 4) = d? for some 
positive integer d, and d must divide (n — 1)(u — A) — 2k. Here the eigenvalues 
must be integers. Various such examples are known. In the special case 1 = 0 
and yu = 2, three such graphs are known, but it is not known whether the list is 
finite! The known examples, listing the parameters (n,k, A, 2), are the square 
(4, 2, 0, 2), the Clebsch graph (16, 5, 0, 2), and the Gewirtz graph (56, 10, 0, 2) 
(see Cameron—van Lint [1991], p43). The Clebsch graph arises in Exercise 23. 
Other strongly regular graphs appear in Exercises 24-26. | 


Finally, we prove the Friendship Theorem. It is startling that such a 
combinatorial-sounding result seems to have no short combinatorial proof. 
There do exist proofs avoiding eigenvalues (see Hammersley [1983]), but they 
require complicated numerical arguments to eliminate regular graphs. 
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8.6.39. Theorem. (Friendship Theorem—Wilf [1971]). If G is a graph in which 
any two distinct vertices have exactly one common neighbor, then G has a 
vertex joined to all others. 


Proof: The symmetry of the condition suggests that G might be regular. If 
G is regular, then it is strongly regular with 4 = 4 = 1. By Theorem 8.6.36, 
5(n —1+k/V/k — 1) now must be an integer. Hence k/./k — 1 is an integer, which 
happens only when k = 2. However, K3 is the only 2-regular graph satisfying 
the condition, and it does have vertices of degree n — 1. 

Now suppose that G is not regular. We show that v 4 w requires d(v) = 
d(w). Insistence on unique common neighbors forbids 4-cycles. Let u be the 
common neighbor of {v, w}. Let a be the common neighbor of {u, v}, and let b 
be the common neighbor of {u, w}. Every » €¢ S = N(v) — {u, a} has a common 
neighbor f(x) with w. If f(x) = b for some x € S, then x, b, u,v is a 4-cycle. If 
f(x) = f(’) for distinct +, x’ € S, then x, v,x’, f(x) is a 4-cycle. We have thus 
shown that d(w) > d(v). By symmetry, d(v) > d(w). 

Since G is not regular, it has two vertices v, w with d(w) 4 d(v). By the 
preceding paragraph, v ~ w. Let uw be their common neighbor. Since wv cannot 
have the same degree as each of them, we may assume that d(u) 4 d(v). If G 
has a vertex x ¢ N(v), then d(x) = d(v), but this requires x @ w and x @ u. 
This creates the 4-cycle v. u, x, w. Hence d(v) =n—1. | 
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EXERCISES 


8.6.1. Interpretation of cycle space and bond space. Given a graph G, prove that 
a) The symmetric difference of two even subgraphs is an even subgraph. 
b) The symmetric difference of two edge cuts is an edge cut, and 
c) Every edge cut shares an even number of edges with every even subgraph. 


8.6.2. Dimension of cycle space and bond space. By parts (a) and (b) of Exercise 8.6.1, 
the cycle space C and bond space B of a graph G are binary vector spaces. Prove that 
when G is connected, C has dimension e(G) — n(G) + 1 and B has dimension n(G) — 1. 
(Hint: Show that the cycles created by adding one edge to a particular spanning tree 
form a basis for the cycle space. Show that n(G) — 1-bonds that isolate single vertices 
form a basis for the bond space, or use orthogonality.) 
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8.6.3. Recall that the closed neighborhood of a vertex v is N(v) U {v}. 

a) Let S be a set of vertices in a simple graph G whose neighborhoods are identical. 
Prove that some eigenvalue of G has multiplicity at least |5| — 1. What is it? 

b) Let S be a set of vertices in a simple graph G whose closed neighborhoods are 
identical. Prove that some eigenvalue of G has multiplicity at least |5| — 1. What is it? 


8.6.4. Let 0, be the number of subgraphs of a graph G that are k-cycles. Let L; = )> ri 
and D, = )- ds be the sums of the kth powers of the eigenvalues and the vertex degrees. 
Obtain formulas for 03 and o, in terms of {L,} and {D,}. 


8.6.5. Deletion formulas for the characteristic polynomial. For clarity in this problem, 
we write $(G; 4) as dc. Let v [xy] be an arbitrary vertex [edge] of G, and let Z(v) [Z@xy)] 
be the collection of cycles containing v [xy]. Prove that the characteristic polynomial 
satisfies the following recurrences. 

8) bg = Adc—v — Dineniny PG-v-u — 2 Dicezyy $6-ViC)- 

b) oc = $G-xy _ $G-x-y = 2 as $6-vic)- 
(Hint: Induction or Sach’s formula can be used. Also, the edge-deletion formula can be 
proved from the vertex-deletion formula. Comment: When G is a forest and v is a leaf 
with neighbor u, the formulas reduce to ¢g = Adc-» — $Gg-v-u and dg = $G-xy — dG-x-y-) 


8.6.6. Characteristic polynomial for paths and cycles. 

a) Use Exercise 8.6.5 to find recurrences for $(P,; A) and for (C,; A). 

b) Without solving the recurrence, praye that {2 cos(27j/n): 0 < j7 <n—1)} are the 
eigenvalues of C,,. 

c) Given Spec (C,,), compute Spec G, Shere G is the graph obtained from C, by 
adding edges joining vertices at distance 2 i An Cin 


8.6.7. For a tree, prove that the coefficient of 4"-** in the characteristic polynomial is 
(—1)* u,.(G), where ;.(G) is the number of matchings of size k. Use this to construct a 
pair of nonisomorphic “co-spectral” 8-vertex trees; both have characteristic polynomial 
48 — 726 +914. (Comment: As n > oo, almost no trees are uniquely determined by their 
spectra.) (Schwenk [1973]) 


8.6.8. (+) Let T be a tree. Prove that a(T) is the number of nonnegative eigenvalues of 
T. (Hint: See Theorem 8.6.20.) (Cvetkovi¢é—Doob—Sachs [1979, p233]) 


8.6.9. Let 4 be an eigenvalue of a graph G with n vertices and m edges. Prove that 
|A| < /2m(n —1)/n. 


8.6.10. Let A1,...,A,, and 441,..., u, be the eigenvalues of G and H, respectively. Show 
that the mn eigenvalues of GOH are {i; + u;}. Use this to derive the spectrum of the 
k-cube. (Hint: Given an eigenvector of A(G) associated with A; and an eigenvector of- 
A(#) associated with ;, construct an eigenvector for A(GO H) associated with A; + 1;.) 


8.6.11. Compute the spectrum of the complete p-partite graph Kn,..m. (Hint: Use the 
expression A(G) = J — I — A(G) for the adjacency matrix of the complement.) 


8.6.12. Given ¢(G; x) = x8 — 24x® — 64x5 — 4854, determine G. 
8.6.13. (!) Prove that G is bipartite if G is connected and Amax(G) = —Amin(G). 


8.6.14. (!) Given a graph G, let R(G) be the matrix whose i, jth entry is dg(v;, v;). Prove 
that the squashed-cube dimension of a graph (Definition 8.4.12) is at least the maximum 
of the number of positive eigenvalues and the number of negative eigenvalues of R(G). 
Conclude that the squashed cube dimension of K,, is n — 1. (Hint: Rewrite the quadratic 
form x’ Rx as asum of squares of linear functions, and apply Sylvester’s Law of Inertia.) 
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8.6.15. (!) The Laplacian matrix Q of a graph G is D — A, where D is the diagonal 
matrix of degrees and A is the adjacency matrix. The Laplacian spectrum is the list 
of eigenvalues of Q. 

a) Prove that the smallest eigenvalue of Q is 0. 

b) Prove that if G is connected, then eigenvalue 0 has multiplicity 1. 

c) Prove that if G is k-regular, then k — 4 is a Laplacian eigenvalue if and only if A 
is an ordinary eigenvalue of G, with the same multiplicity. 


8.6.16. Given a real symmetric matrix partitioned as M = (; ot R |) with P, R square, a 
lemma in linear algebra yields Awax(M) + Amin(M) < Amax(P) + Amax(R). 

a) Let A be a real symmetric matrix partitioned into t? submatrices A,,; such that 
the diagonal submatrices A;; are square. Prove that 


Amax (A) G= l)Amin(A) < < Ae Amax(Aii). 


b) Prove that x(G) > 1+ Amax(G)/(—Amin(G)) when G is nontrivial. (Wilf) 
c) Use the Four Color Theorem to prove that 4;(G) + 34,(G) < 0 for planar graphs. 


8.6.17. (!) Use Theorem 8.6.28 to count the spanning trees in K,,,,. (Comment: See 
Exercise 2.2.11.) 


8.6.18. (+) Given a matrix A, let b;,; equal (—1)'*/ times the matrix obtained by deleting 
row i and column j of A. Let Adj A be the matrix whose entry in position i, j is b;,;. The 
definition of the determinant by expansion along rows of A yields A(Adj A) = (det A)/. 
Use this formula to prove that if the sum of the columns of A is the vector 0, then 3; ; 
is independent of j. (Comment: With the next exercise, this completes the proof of the 
Matrix Tree Theorem (Theorem 2.2.12).) 


8.6.19. (+) Let C = AB, where A and B are nxmandm xn matrices. Given S C [ml], 
let As be the n x n matrix whose columns are the columns of A indexed by S, and let Bs 
be the n x n matrix whose rows are the rows’of B indexed by S. Prove the Binet-Cauchy 
Formula: detC = > 5 det As det Bs, where’ the summation extends over all n-element 


subsets of [m]. (Hint: Consider the matrix equation ( a A a ) = ee rae. 
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8.6.20. A matrix is totally unimodular if every square submatrix has determinant in 
{0,1,—1}. Prove that the incidence matrix of a simple graph is totally unimodular if 
and only if the graph is bipartite. (Reminder: The incidence matrix of a simple graph 
has two +1’s in each column). 


8.6.21, (—) Let G be an (n, k, c)-magnifier with vertices v,,..., v,. Let H be the bipartite 
graph with partite sets X = {x,...,x,} and Y = {y,..., y,} such that x;y; € E(H#) if 
and only ifi = j or v;v; € E(G). Prove that H is an (n,k + 1, c)-expander. 


8.6.22. Existence of expanders of linear size. 

a) Let X be a random variable giving the size of the union of k s-subsets of [n] chosen 
at random from ae Prove that P(X <1) < (7) (l/n)*. 

b) (+) For af < 1, prove that there is a constant k such that, when n is sufficiently 
large, there exists a subgraph of K,,,, with maximum degree at most k such that |N(S)| > 
B |S| whenever |S| < an. (Hint: Generate bipartite subgraphs of K,,,, by taking the union 
of k random perfect matchings.) 

c) Conclude the existence of k such that n, k, c-expanders exist for all sufficiently 
large n. An (n,a, 8, d)-expander is a bipartite graph G C Ka, with |A| = |B| = 2, 
A(G) < d, and |N(S)| > 6|S| whenever |S| < an. 


8.6.23. Let G be a triangle-free graph on n vertices in which every pair of nonadjacent 
vertices has exactly two common neighbors. Prove that G is regular and that n = 1+ 
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ge where k is the degree of the vertices in G. Prove that G is strongly regular. What 
constraints on k are implied by the integrality conditions? Construct examples for all 
k € {1, 2,5}. A realization for k = 10 is known using combinatorial designs.) 


8.6.24. (+) Prove that the Petersen graph is strongly regular, and determine its spec- 
trum (the spectrum is easy with properties of strongly regular graphs and not hard 
without them). Apply the spectrum to show that edges of the complete graph Ko cannot 
be partitioned into three disjoint copies of the Petersen graph. (Hint: Use the spectrum 
to prove that two copies of the Petersen matrix have a common eigenvector other than 
the constant vector.) (Schwenk [1983]) 


8.6.25. Let F = GOH, where G and H are simple graphs. Prove that, if every two non- 
adjacent vertices in F have exactly two common neighbors, then G and H are complete 


graphs. 


8.6.26. The subconstituents of a graph are the induced subgraphs of the form G[U], 
where v € V(G) and U = N(v) or U = N[v]. Vince [1989] defined G to be superregu- 
lar if G has no vertices or if G is regular and every subconstituent of G is superregular. 
Let S be the class consisting of {aK,: a, b > 0} (disjoint unions of isomorphic cliques), 
{Ky O Ky: m > 0}, Cs, and the complements of these graphs. 

a) Prove that every graph in S is superregular and that every disconnected super- 
regular graph is in S. (Comment: In fact, every superregular graph is in S, but the 
complete inductive proof of this requires several pages (Maddox [1996], West [1996]) 

b) Prove that every superregular graph is strongly regular. 


8.6.27. (+) Automorphisms and eigenvalues. 

a) Prove that o is an automorphism of G if and only if the permutation matrix cor- 
responding to o commutes with the adjacency matrix of G; that is, PA = AP. 

b) Let x be an eigenvector of G for an eigenvalue of multiplicity 1, and let P be the 
permutation matrix for an automorphism of G. Prove that Px = +x. 

c) Conclude that when every eigenvalue of G has multiplicity 1, every automor- 
phism of G is an involution, meaning that repeating it yields the identity. (Mowshowitz 
[1969], Petersdorf—Sachs [1969]) 


8.6.28. (+) Light bulbs ),...,/, are controlled by switches s,...,5,. The ith switch 
changes the on/off status of the ith light and possibly others, buts; changes the status 
of /; if and only if s; changes the status of /;. Initially all the lights are off. Prove that 
it is possible to turn all the lights on. (Peled [1992]) (Hint: This uses vector spaces, not 
eigenvalues.) 


Appendix A 
Mathematical Background 


This appendix summarizes aspects of language and mathematics that are 
not directly part of graph theory but provide useful background for learning 
graph theory. Where appropriate, we mention examples in the context of 
graphs, so it is best to read this appendix in conjunction with Chapter 1. ‘This 
presentation is modeled on material in the first half of Mathematical Thinking, 
by John P. D’Angelo and Douglas B. West (Prentice—Hall, second edition, 2000). 


SETS 


Our most primitive mathematical notion is that of a set. It is so funda- 
mental that we cannot define it in terms of simpler concepts. We think of a set 
as a collection of distinct objects with a precise description that provides a way 
of deciding (in principle) whether a given object is in it. 


0 


A.1. Definition. The objects in a set are its elements or members. When x 
is an element of A, we write x € A and say “x belongs to A”. When x is 

not in A, we write x ¢ A. If every element of a set B belongs to A, then B 

is a subset of A, and A contains B ;we write B CAorAD B. 

For example, we may speak of the set A of graphs with n vertices. When 
we impose an additional restriction, such as requiring that the graphs also be 
connected, we obtain a subset of A. 

When we list the elements of a set explicitly, we put braces around the list; 
“A = {-—1, 1}” specifies the set A consisting of the elements —1 and 1. Writing 
the elements in a different order does not change a set. We write x, y € S to 
mean that both x and y are elements of S. 


A.2. Example. We use the characters N, Z, Q, and R to name the sets of natu- 
ral numbers, integers, rational numbers, and real numbers, respectively. 
Each set in this list is contained in the next, so we writeNC ZCQCR. 
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We treat these sets and their elements as familiar objects. By conven- 
tion, 0 is not a natural number, so N = {1,2,3,...}. The set of integers is 
Z = {...,—-2,—1,0,1,2,...}. The set Q of rational numbers is the set of real 
numbers expressible as a/b witha,b € Zand b £0. 

We also take as familiar the elementary arithmetic properties of these 
number systems. These include the rules that permit algebraic manipulation 
of expressions, equalities, and inequalities. They also include elementary prop- 
erties about divisibility of integers. | 


A.3. Definition. Sets A and B are equal, written A = B, if they have the same 
elements. The empty set, written G, is the unique set with no elements. 
A proper subset of a set A is a subset that is not A itself. 


The empty set is a subset of every set, and every set is a subset of itself. The 
definition of subgraph (Definition 1.1.16) is similar. Every graph is a subgraph 
of itself, but something must be discarded to obtain a proper subgraph. 

“Solving a mathematical problem” often means describing a given set more 
simply. We must show that the set of objects satisfying the new description is 
equal to the given set. 


A.4. Remark. Equality of sets. To prove that A = B, we prove that every 
element of A is in B and that every element of B is in A; in other words, A C B 
and B CA. It also suffices to turn the description of one set into the description 
of the other by operations that do not change membership. 

This book proves many characterization theorems for classes of graphs. 
Such a theorem states that two sets are the same (example: the set of bipartite 
graphs is equal to the set of graphs without odd cycles—Theorem 1.2.18). 

Often, a mathematical model defines a set S of solutions; these are the 
objects that satisfy the conditions of the problem. We want to list or describe 
the solutions explicitly; this specifies a set T. The problem is to show that 
S = T. Proving S C T means showing that every solution belongs to 7. Proving 
T C S means showing that every member of T is a solution. | 


A.5. Remark. Specifying a set. Given a set A, we may want to specify a subset 
S consisting of the elements of A that satisfy a given condition. To do so, we 
write “S = {x € A: condition(x)}”. We read this as “S is the set of elements x in A 
such that x satisfies ‘condition’ ”. For example, the expression {n € N : n? < 25) 
is another way to name the set {1, 2, 3, 4, 5}. 

In this format, the set A is the universe for x; we can drop this part of the 
notation when the context makes it clear. For example, {n?: n € N} is the set of 
positive integers squares. a 


Many special sets have common names and/or notation. 


A.6. Definition. When a, b € Z, we write {a,..., b} for {i € Z:a <i < b}. When 
n € N, we write [n] for {1,..., 7}; also [0] = @. The set of even numbers 
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is {2k: k € Z}. The set of odd numbers is {2k + 1: k € Z}. The parity of 
an integer states whether it is even or odd. 


Note that 0 is an even number. We say “even” and “odd” for numbers only 
when discussing integers. Every integer is even or odd; none is both. 


A.7. Definition. A partition of a set A is a list A1,..., A; of subsets of A such 
that each element of A appears in exactly one subset in the list. 


The set of even numbers and the set of odd numbers partition Z. In a par- 
tition of A into A;,..., Ax, the sets A;,..., A; in the list are called “blocks” or 
“classes” or “parts” or “partite sets”. The use of “blocks” is common in combi- 
natorics, but graph theory has another definition for the word “block”, so we 
usually use “classes” or “sets”. “Partite sets” is used only for the sets in a 
partition of the vertex set of a graph into independent sets. 


A.8. Remark. Conventions about universes. When we write “[n]”, it is under- 
stood that n is a nonnegative integer. When we speak of n as the number of 
vertices in a graph, by context we know that n is a natural number. When we 
say only that a number is positive without specifying the number system con- 
taining it, we mean that it is a positive real number. Thus, “consider x > 0” 
means “let x be a positive real number”, but in “For n > 2, let G be a n-vertex 
graph” our convention is thatn EN. — a 


n.me/n 


A.9. Definition. A set A is finite if there is a one-to-one correspondence be- 
tween A and [n] for some n € N U {0}. This n is the size of A, written | Al]. 


Another elementary property of number systems is that a set A cannot be in 
one-to-one correspondence with both [m] and [n] when m # n. Thus the size of a 
finite set is a well-defined integer. Counting a set means determining its size. 


A.10. Remark. “If” in definitions. It is a common convention in definining 
mathematical properties to say that an object has a certain property if it sat- 
isfies a certain condition. Subsequently, the condition can be substituted for 
the property and vice versa, so the “if” really means “if and only if”. This con- 
ventional usage in definitions reflects the notion that the concept being defined 
does not exist until the definition is complete. a 


There are several natural ways to obtain new sets from old sets. 


A.11. Definition. Let A and B be sets. Their union A U B consists of all 
elements in A or in B (or both). Their intersection A/ B consists of all 
elements in both A and B. Their difference A — B consists of the elements 
of A that are not in B. Their symmetric difference AaB is the set of 
elements belonging to exactly one of A and B. 

Two sets are disjoint if their intersection is the empty set 0. Ifa set A 
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is contained in some universe U under discussion, then the complement 
A of A is the set of elements of U not in A. 


When we speak of taking the “complement” of a simple graph, we are keep- 
ing the vertex set unchanged and taking the complement of the edge set (viewed 
as pairs of vertices) within the universe of vertex pairs. Other times we at . 
the complement S of a set of vertices S in G; in this case we mean S = V(G) — 


A.12. Remark. In a Venn diagram, an outer box represents the universe 
under consideration, and regions within the box correspond to sets. Non- 
overlapping regions correspond to disjoint sets. The four regions in the Venn 
diagram for two sets A and B represent AN B, (AUB), A— B, and B — A. Note 
that AaB = (A — B) U(B-—A), 

Since A—B consists of the elementsin A and not in B, we have A—B = ANB. 
Similarly, the diagram suggests that B is the union of A — B and (A U B), which 
are disjoint. It also suggests that the symmetric difference AAB is obtained 
from the union by deleting the intersection. | 


A.13. Remark. When A and B are sets, AAB = (AU B) — (ANB). The union 
starts with all elements in at least one of A and B; we delete those in both. 
When A and B are finite sets, |AU B|+|AM B| = |A|+]|B|. Each element of 
the intersection is counted twice on both sides, each element of the symmetric 
difference is counted once on both sides, and no other elements are counted. & 


A.14, Definition. A list with entries in A consists of elements of A in a spec- 
ified order, with repetition allowed. A k-tuple is a list with k entries. We 
write A* for the set of k-tuples with entries in A. When A = {0, 1}, A* is 
the set of binary k-tuples. 

An ordered pair (x, y) is a list with two entries. The cartesian 
product of sets S and 7, written S x T, is the set {(x, y):x € S,y € T}. 


Note that A? = Ax A and A* = {(xj,..., x): x; € A}. We read “x,” as “x sub 
i”, When S = T = Z, the cartesian product S x T is the integer lattice, the 
set of points in the plane with integer coordinates. 
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QUANTIFIERS AND PROOFS 


Roughly speaking, a mathematical statement is a statement that can be 
determined to be true or false. This requires correct mathematical grammar, 
and it requires that variables be “quantified”. 

For example, the sentence x? — 4 = 0 cannot be determined to be true 
or false because we do not know the value of x. It becomes a mathematical 
statement if we precede it with “When x = 3,” or “For x € {2, —2},” or “For some 
integer x,”. 

If a sentence P(x) becomes a mathematical statement whenever the vari- 
able x takes a value in the set S, then the two sentences below are mathematical 
statements. 


“For all x in S, the sentence P(x) is true.” 
“For some x in S, the sentence P(x) is true.” 


A.15. Definition. In the statement “For all x in S, P(x) is true”, the variable x 
is universally quantified. We write this as (Vx € S) P(x) and say that V 
is a universal quantifier. In “For some x in S, P(x) is true”, the variable 
x is existentially quantified. We write this as (Ax € S)P(x) and say that 
J is an existential quantifier. The set of allowed values for a variable is 
its universe. , 


) 


h_b« 


A.16. Remark. English words that express quantification. Typically, “every” 
and “for all” represent universal quantifiers, while “some” and “there is” rep- 
resent existential quantifiers. We can also express universal quantification by 
referring to an arbitrary element of the universe, as in “Let x be an integer’, 
or “A student failing the exam will fail the course”. Below we list common 
indicators of quantification. 


Universal (¥) 
for [all], for every 


Existential (3) (helpers) 
for some 


(helpers) 


if then there exists such ‘that 
whenever, for, given at least one for which 
every, any satisfies some satisfies 
a, arbitrary must, is has a such that 
let be 


The “helpers” may be absent. Consider “The square of a real number is non- 
negative”. This means x? > 0 for every x € R; it is not a statement about one 
real number and cannot be verified by an example. When we write “A bipartite 
graph has no odd cycle”, we mean “in every bipartite graph there is no odd cy- 
cle”. When we write “Let G be a bipartite graph”, we mean that every bipartite 
graph is under consideration. When we take an “arbitrary” vertex in a graph, 
we are considering each one individually. When we discus an “arbitrary” pair 
of vertices in a graph, we are considering each pair, one at a time. 

The difference between “for every G” and “for every graph G” is that the 
latter specifies the universe for the universally quantified variable G. | 
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Existential quantifiers state lower bounds; “there is a” and “there are two” 
mean “at least one” and “at least two”. Phrases like “there is a unique” and 
“there are exactly two” indicate equality. Sometimes equality is clear from 
context, but it does not hurt to make it explicit when it is intended. 

A statement may have more than one quantifier. Consider the sentence 
“There are triangle-free graphs with arbitrarily large chromatic number”. 
Phrased using explicit quantifiers, this means “For every n e€ N, there ex- 
ists a triangle-free graph with chromatic number at least n”. The expression 
“arbitrarily large” often conveys an implicit universal quantifier in this way. 

In contrast, the expression “sufficiently large” imposes an implicit existen- 
tial quantifier. The statement “2” > n!9° when n is sufficiently large” means 
“There exists N € N such that for all n > N, the inequality 2” > n'° holds”. 


A.17. Remark. The meaning of a statement with more than one quantifier 
depends on their order. Compare these two sentences: 


“For every graph G, there exists m € N such that every v ¢ V(G) has degree at most m” 
“There exists m € N such that for every graph G, every v € V(G) has degree at most m” 


The first statement is true; the second is false. Every (finite) graph has a max- 
imum degree, but there is no maximum over all graphs. We write the two 
sentences in logical notation as E 


(¥G)(am € N)(Wu E V(G))(dg(v) < m). 
(am € N)(WG) (Wu € V(G))(dg(v) < m). 


boc 


In English, quantifiers often appear at the ends of sentence to enhance 
readability, as in “I feel happy every time I learn something new.” In sentences 
with abstract concepts and more than one quantifier, we adopt conventions 
about order to avoid confusion. Quantifiers apply in the order in which they are 
stated. In particular, a variable is chosen in terms of the preceding variables. 

For example, in (VG)(Am € N)P(G, m), we have the freedom to choose m 
after knowing what G is. In (Am € N)(VG)P(G, m), we must choose a single m 
that works for all G. | 


A.18. Remark. Negation of quantified statements. The logical symbol for nega- 
tion is —. If it is false that all x € S make P(x) true, then there must be some 
x € S such that P(x) is false. Similarly, negating an existentially quantified 
statement yields a universally quantified negation. In notation, 


=[(v¥x € S)P(x)] has the same meaning as (Ax € S)(—P(x)). 
—=[(dax € S$) P(x)] has the same meaning as (Vx € S)(—P(x)). 


The universe of quantification does not change when the statement is negated. 
For example, the false statement. in Remark A.17 was 


(dm € N)(VG)(Vu € V(G))(dg(v) < m). 


Its negation is the same as (Vm € N)(AG)[—((Wu € V(G))(dg(v) < m))], which 
we further simplify to (¥m € N)(AG)(au € V(G))(dg(v) > m). This statement is 
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“for every natural number m, there is some graph having a vertex with degree 
greater than m”, which is true. a 


Logical connectives permit us to build compound statements. 
A.19. Definition. Logical connectives. In the following table, we define the 


operations named in the first column by the truth values specified in the 
last column. 


Name Symbol Meaning Condition for truth 
Negation a=P not P P false 

Conjunction PAQ PandQ both true 

Disjunction PvVQ Por@ at least one true 
Biconditional Ps@Q Pif&onlyif Q same truth value 
Conditional P=»Q _ Pimplies Q Q true whenever P true 


A.20. Remark. Conjunction and disjunction are quantifiers over the truth of 
their component statements. A conjunction (“and”) is true precisely when all of 
its component statements are true. A disjunction (“or”) is true precisely when 
there exists a true statement among its components. Our understanding of 
negation thus yields logical equivalence between —(P A Q) and (-P) v (=Q) 
and between —(P Vv Q) and (—P) A (-Q). | 


Fs 


1e/ 


A.21. Definition. In the conditional statement P= Q, we call P the hypoth- 
esis and Q the conclusion. The statement Q = P is the converse of 
P=>@Q. . 


A.22. Remark. Conditionals. Conditional statements are the only type in Def- 
inition A.19 whose meaning changes when P and Q are interchanged. There 
is no general implication between P > Q and its converse Q => P. Consider 
these three statements about a graph G: P is “G is a path’, Q is “G is bipar- 
tite”, and R is “G has no odd cycles”. Here P > Q is true but Q = P is false. 
On the other hand, both Q > R and R => Qaretrue. © 

Note that here G is a variable. We have dropped G from the notation for 
the statements because the context is clear. The precise meaning of P > Q 
using G is (VG)(P(G) > Q(G)). 

A conditional statement is false when and only when the hypothesis is true 
and the conclusion is false. Thus the meaning of P > Q is (—P) v Q; the two 
are logically equivalent. Every conditional statement with a false hypothesis is 
true, regardless of whether the conclusion is true. The meaning of ~(P > Q) 
is PA(-Q). 

Below we list ways to say P > Q in English. | 


If P (is true), then Q (is true). P is true only if Q is true. 
Q is true whenever P is true. P is a sufficient condition for Q. 
Q is true if P is true. Q is a necessary condition for P. 
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The business of mathematics is proving implications. Note that univer- 
sally quantified statements can be interpreted as conditional statements. The 
statement “(VG € G)(P(G))” has the same meaning as “If G € G, then P(G)” 
(consider the two statements when G is the family of bipartite graphs and P(G) 
is the assertion that G has no odd cycles). 

The basic proof methods come from the meaning of conditional statements. 


A.23. Remark. Proving implications. The direct method of proving P > Q 
is to assume that P is true and then to apply mathematical reasoning to deduce 
that Q is true. When P is “x € A” and Q is “Q(x)”, the direct method considers 
an arbitrary x € A and deduces Q(x). There is no “proof by example”. The proof 
must apply to every member of A as a possible instance of x. 

The contrapositive of P = Q is -Q = --P. Each of these statements 
fails only when P is true and Q is false. Thus they are equivalent; we can prove 
P => Q by proving -Q = —P. This is the contrapositive method. 

We have observed that (P = Q) © —[P A (-—@Q)]. Hence we can prove 
P = Q by proving that P and —Q cannot both be true. We do this by obtain- 
ing a contradiction after assuming both P and -—Q. This is the method of 
contradiction. 

The two latter methods are indirect proof. When the direct method for 
P = Q doesn’t seem to work, we say “Well, suppose not”. At that point we are 
starting from the assumption ~Q. We:need not know in advance whether we 
are seeking to derive —P (contrapositive method) or seeking to use P and -Q 
to obtain a contradiction. | 


gram.m 


Examples of each of these methods appear in the text. Indirect proof is 
promising when the negation of the conclusion provides useful information. 
This approach may be easier than finding a direct proof, because both the hy- 
pothesis and the negation of the conclusion can be used. If the contradiction 
we obtain is the impossibility of our original assumption —Q, then usually we 
can rewrite the proof in simpler language as a direct proof. If instead we obtain 
—P, then we have proved the contrapositive. 


A.24. Remark. Biconditional statements. The biconditional statement “P = 
Q” has the same meaning as “(P => Q)A(Q => P)”. We read it as “P if and 
only if Q”, where “QO = P” is “P if Q”, and “P > Q” is “P only if Q”. 

Although sometimes we can prove a biconditional statement by a chain of 
equivalences, usually we prove a conditional statement and its converse; the 
latter is also a conditional statement. For each we have the three fundamental 
methods above. To prove P & Q, we must prove one statement in each column 
in the table below. The lines are the direct method, the contrapositive method, 
and the method of contradiction, respectively. Proving two statements in the 
same column would amount to proving the same statement twice. | 


P>Q Q>P 


=Q>-P af => 30 
—(P A7Q) -(Q A =P) 
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Students sometimes wonder about the precise meanings of words like “the- 
orem”, “lemma”, and “corollary” that are used to designate mathematical re- 
sults. In Greek, lemma means “premise” and theorema means “thesis to be 
proved”. Thus a theorem is a major result requiring some effort. A lemma is 
a lesser statement, usually proved in order to help prove other statements. A 
proposition is something “proposed” to be proved; typically this takes less ef- 
fort than a theorem. The word corollary comes from Latin, as a modification of 
a word meaning “gift”; a corollary follows easily from a theorem or proposition, 
without much additional work. 


INDUCTION AND RECURRENCE 


Many statements having a natural number as a variable can be proved 
using the technique of induction. In Theorem 1.2.1, we describe the strong ver- 
sion of induction. Here we review the ordinary version that most students learn 
when they first encounter induction. It involves the Well Ordering Property for 
the natural numbers, which states that every nonempty subset of N has a least 
element. We take this as an axiom, as part of our intuitive understanding of 
what N is. Although we then state the.Principle of Induction as a Theorem, in 
reality it is equivalent to the Well Ordgring Property for N. 


A.25. Theorem. (Principle of Induction) For each natural number n, let P(n) 
be a mathematical statement. If properties (a) and (b) below hold, then for 
each n € N the statement P(n) is true. 

a) P(1) is true. 
b) For k €N, if P(k) is true, then Pk + 1) is true. 


Proof: If P(n) is not true for all n, then the set of natural numbers where it fails 
is nonempty. By the Well Ordering Property, there is a least natural number 
in this set. By (a), this number cannot be 1. By (b), it cannot be bigger than 1. 
The contradiction implies that P(n) is true for all n. | 


When applying the method of induction, we prove statement (a) in Theorem 
A.25 as the basis step and statement (b) as the induction step. Statement (b) 
is a conditional statement, and its hypothesis (“P(k)” is true) is the induction 
hypothesis. We present one example in rather formal language. 


A.26. Proposition. If S is a set of n lines iri the plane such that every two have 
exactly one common point and no three have a common point, then S cuts 
the plane into 1+ n(n + 1)/2 regions. 


Proof: We use induction on n to prove the claim for all n € N. Let P(n) be the 
statement that the claim holds for all such sets of n lines. 

Basis step (P(1)). With one line the number of regions is 2, which equals 
1+1(14+1)/2. 

Induction step (P(k) => P(k+1)). The statement P(k) is the induction 
hypothesis. Let § be a set of k + 1 lines meeting the conditions. Select a line L 
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in S (the dashed line in the figure), and let S’ be the set of & lines obtained by 
deleting L from S. 

Since S’ meets the conditions, the induction hypothesis states that S’ cuts 
the plane into 1+ k(k + 1)/2 regions. When we replace L, some regions are cut. 
The increase in the number of regions is the number of regions that L cuts. 
It moves from one of these regions to another each time it crosses a line in S’. 
Since L crosses each line in S’ once, the lines in S’ cut L into k + 1 pieces. Each 
piece corresponds to a region that L cuts. 

Thus the number of regions formed by S is k + 1 more than the number of 
regions formed by S’. The number of regions formed by S is 


1+k(k+1)/24+ (k4+1) =14+(k4+1)(k 4 2)/2. 


We have proved that P(k) implies P(k + 1). 
By the principle of induction, the claim holds for every n € N. | 


a 


A.27. Remark. The discussion of Proposition A.26 suggests several comments 
about proof by induction. Note first that we could also have used n = 0 as the 
basis step to prove the statement for all nonnegative n. 

It is not immediately obvious from the statement of the problem that the 
number of regions is the same for all sets of n lines, but this follows because we 
proved a formula for this number that depends only on n. 

In the proof of the induction step, we began with L, an instance of the 
larger-sized problem. This approach ensures that we have considered all such 
instances; we return to this point shortly. 

We proved P(k + 1) from P(k) as suggested by statement (b) of Theorem 
A.25. In most examples in this book, we use a different phrasing that is more 
consistent with strong induction as introduced in Section 1.2. To prove P(n) 
for all n € N, in this example we would write “Basis step: n = 1. .” and 
then “Induction step: n >1....”. In the proof of the induction step, we would 
consider an arbitrary set S of n lines and apply the induction hypothesis to the 
set S’ obtained by deleting one line L. 

The content of the proof is the same in both phrasings. The phrasing that 
we have just described emphasizes the item about which the claim is proved. 
The basis step directly verifies the claim for the smallest value of the induction 
parameter. When the parameter has a larger value, the claim about the item 
is proved using the hypothesis that it holds for an earlier item; this is the 
induction step. Invoking it repenteny? yields the claim for each subsequent 
value of the parameter. | 
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When learning to use induction in graph theory, many students have trou- 
ble with two particular aspects. One is when the statement P(n) being proved 
by induction is itself a conditional statement A(n) = B(n). The induction hy- 
pothesis is the statement A(n — 1) => B(n — 1). A template for the induction 
step in this situation is presented in Remark 1.3.25, and there are examples of 
this throughout Chapter 1. 

The other pitfall we call the “induction trap”, discussed at length in Exam- 
ple 1.3.26. Here we provide another example, using the language of proving 
P(n + 1) from P(n) that sometimes leads students into the trap. 


A.28. Example. The Handshake Problem. Let a handshake party of order 
n (henceforth “n-party”) be a party with n married couples where no spouses 
shake hands with each other and the 2n — 1 people other than the host shake 
hands with different numbers of people. We use induction on 7 to prove that in 
every n-party, the hostess shakes with exactly n — 1 people. 

We model the party using a simple graph in which the vertices are the 
people at the party and the edges are the pairs who shake hands. The degree 
of a vertex is its number of handshaking partners. If no one shakes with his 
or her spouse, then each degree is between 0 and 2n — 2. The condition that 
the 2n — 1 numbers other than the host’s are distinct implies that the degrees 
are 0 through 2n — 2. The figure below shows for n ¢ {1, 2, 3} the graph that is 
forced; each circled pair of vertices indicates a married couple, with host and 
hostess rightmost in each graph. 
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Basis step: If n = 1, then the hostess shakes with 0 (which equals n — 1), 
because the host and hostess don’t shake. 

Induction step (INVALID): The induction hypothesis is that the claim 
holds for n-parties. Consider such a party. By the induction hypothesis, the 
degree of the hostess is n — 1. By our earlier discussion, the degrees of ver- 
tices other than the host are 0,..., 2n — 2. We form an (n + 1)-party by adding 
one more couple. Let one member of the new couple shake with everyone in 
the first n couples; the other shakes with no one. This increases the degree of 
each of the earlier vertices by 1, so those degrees other than the host are now 
1,...,2n — 1, and the new couple have degrees 0 and 2n. Hence the larger con- 
figuration is an (n + 1)-party. The degree of the hostess has increased by 1, so 
it is n. 

Induction step (VALID): The induction hypothesis is that the claim holds 
for n-parties. Consider an (n + 1)-party. By our earlier discussion, the degrees 
other than the host are 0,...,2n. Let p; denote the person of degree i among 
these. Since pg, shakes with all but one person, the person pp who shakes 
with no one must be the only person missed by p2,. Hence po is the spouse of 
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pon. Furthermore, this married couple S = {po, p2,} is not the host and hostess, 
since the host is not in {ppo,..., Pon}. 

Everyone not in S shakes with exactly one person in S, namely p2,. If we 
delete S to obtain a smaller party, then we have n couples remaining (including 
the host and hostess), no person shakes with a spouse, and each person shakes 
with one fewer person than in the full party. Hence in the smaller party the 
people other than the host shake hands with different numbers of people. 

By deleting the set S, we thus obtain an n-party (deleting the leftmost 
couple in the picture for n = 3 yields the picture for n = 2). Applying the 
induction hypothesis to this n-party tells us that, outside of the couple S, the 
hostess shakes with n -- 1 people. Since she also shakes with po, € S, in the full 
(n + 1)-party she shakes with n people. | 


The first argument in Example A.28 falls into the induction trap, because 
it does not consider all possible (n + 1)-parties. It considers only those obtained 
by adding a couple to an n-party in a certain way, without proving that every 
(n + 1)-party is obtained in this way. 

Starting with an arbitrary (n + 1)-party forces us to prove that every (n + 
1)-party arises in this way in order to obtain a configuration where we can 
apply the induction hypothesis. We cannot discard just any married couple to 
obtain the smaller party. We must find a couple S such that everyone outside 
S shakes with exactly one person in S. “Only then will the smaller party satisfy 
the hypotheses needed to be an n-party. 

The need to show that our smaller object satisfies the conditions in the 
induction hypothesis replaces the need to prove that all objects of the larger 
size were generated by growing from an object of the smaller size. 


Sometimes the proof of the induction step uses more than one earlier in- 
stance. If we always use both P(n — 2) and-P(n — 1) to prove P(n), then we 
must verify both P(1) and P(2) to get started. The proof of the induction step 
is not valid for n = 2, since there is no P(0) to use. ; 


A.29. Example. Let a), a2,... be defined by a; = 2, az = 8, anda, = 4(a,_1 — 
a,-2) for n > 3. We seek a formula for a, in terms of n. 

We may try to guess a formula that fits the data. The definition yields 
a3 = 24, a4 = 64, and as = 160. All these satisfy a, = n2”. Having guessed this 
as a possible formula for a,, we can try to use induction to prove it. 

When n = 1, we have a; = 2 = 1-2!. Whenn = 2, we have ag = 8 = 2-2”. 
In both cases, the formula is correct. 

In the induction step, we prove that the desired formula is correct for n > 3. 
We use the hypothesis that the formula is correct for the preceding instances 
n—1andn -— 2. This allows us to compute a, using its expression in terms of 
earlier values: 


Gy = 4(dy—1 — Qy_2) = 4[(n — 1)2"71! — (n — 2)2"-?] = (2n — 2)2" — (n — 2)2" = 2". 


The validity of the formula for a, follows from its validity for a,—1 and a,_9, 
which completes the proof. | 
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In this proof, we must verify the formula for n = 1 and n = 2 in the basis 
step; the proof of the induction step is not valid when n = 2. Example A.29 
specifies aj. a2.... by a recurrence relation. The general term a, is specified 
using earlier terms. Similarly, the proof of Proposition A.26 yields a recurrence 
for the number r,, of regions formed by n lines; r, = r,_1 +1, with r; = 2. 

If the recurrence relation uses & earlier terms to compute a,,, then we must 
provide & initial values in order to specify the terms exactly; this is a recurrence 
of order k. Statements proved by induction about recurrences of order k typi- 
cally require verification of k instances in the basis step. Standard techniques 
from enumerative combinatorics yield solutions to many recurrence relations 
without guessing formulas or directly using induction. 

We also sometimes use recursive computation in graph theory. We may 
have a value for each graph instead of just one for each “size” as in a sequence. 
If we can express the value for a graph G as a formula in terms of graphs with 
fewer edges (and specify the values for graphs with no edges), then again we 
have a recurrence. We use this technique to count spanning trees (Section 2.2) 
and proper colorings (Section 5.3). 


FUNCTIONS 


A function transforms elements of-one set into elements of another. 
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A.30. Definition. A function f from a set A to a set B assigns to each a € A 
a single element f(a) in B, called the i image of a under f. For a function 
f from A to B (written f: A > B), the set A is the domain and the set B 
is the target. The image of a function f with domain A is { f(a): a € A}. 


We take many elementary functions as familiar, such as the absolute value 
function and polynomials (both defined on R). “Size” is a function whose domain 
is the set of finite sets and whose target is N U {0}. 


A.31. Definition. For x € R, the floor |. | is the greatest integer that is at most 
x. The ceiling [x] is the smallest integer that is at least x. A sequence 
is a function f whose domain is N. 


The floor function and ceiling function map R to Z. When the target of a se- 
quence is A, we have a sequence of elements in A, and we express the sequence 
AS a1, a2, a3,..., where a, = f(n). We have used induction to prove sequences 
of statements and to prove formulas specifying sequences of numbers. 

We may want to know how fast a function from R to R grows, particularly 
when analyzing algorithms. For example, we say that the growth of a function 
g is (at most) quadratic if it is bounded by a quadratic polynomial for all 
sufficiently large inputs. A more precise discussion of growth rates of functions 
appears in Appendix B. 
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A.32. Remark. Schematic representation. A function f: A — B is defined on 
A and maps A into B. To visualize a function f: A — B, we draw a region 
representing A and a region representing B, and from each x € A we draw 
an arrow to f(x) in B. In digraph language, this produces an orientation of a 
bipartite graph with partite sets A and B in which every element of A is the 
tail of exactly one edge. 

The image of a function is contained in its target. Thus we draw the region 
for the image inside the region for the target. | 


domain 


To describe a function, we must specify f(a) for each a € A. We can list 
the pairs (a, f(a)), provide a formula for computing f(a) from a, or describe the 
rule for obtaining f(a) from a in words. 
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A.33. Definition. A function f: A — 8 is a bijection if for every b € B there 
is exactly one a € A such that f(a) =b. 
Under a bijection, each element of the target is the image of exactly one 
element of the domain. Thus when a bijection is represented as in Remark 
A.32, every element of the target is the head of exactly one edge. 
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A.34, Example. Pairing spouses. Let M be the set of men at a party, and let W 
be the set of women. If the attendees consist entirely of married couples, then 
we can define a function f: M — W by letting f(x) be the spouse of x. For each 
woman w € W, there is exactly one x € M such that f(x) = w. Hence f isa 
bijection from M to W. | 


Bijections pair up elements from different sets. Thus we also describe a 
bijection from A to B as a one-to-one correspondence between A and B. 
Occasionally in the text we say informally that elements of one set “correspond” 
to elements of another; by this we mean that there is a natural one-to-one 
correspondence between the two sets. 

When A has n elements, listing them as aj, ..., a, defines a bijection from 
In] to A. Viewing the correspondence in the other direction defines a bijection 
from A to [n]. All bijections can be “inverted”. 


A.35. Definition. If f is a bijection from A to B, then the inverse of f is the 
function g: B — A such that, for each b € B, g(b) is the unique element 
x € Asuch that f(x) = b. We write f~—! for the function g. 
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When the target of a function is the domain of a second function, we can 
create a new function by applying the first and then the second. This yields a 
function from the domain of the first function into the target of the second. 


A.36. Definition. If f: A — B and g: B > C, then the composition of g with 
f is a function h: A —> C defined by A(x) = g(f(x)) for x € A. When h is 
the composition of g with f, we write h = go f. 


From the definitions, it is easy to verify that the composition of two bijec- 
tions is a bijection. We use this in Proposition 1.1.24 in verifying for graphs 
that a composition of isomorphisms is an isomorphism. 


h 


a 


COUNTING AND BINOMIAL COEFFICIENTS 
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A discussion of counting quickly leads to summations and products. These 
can be written concisely using appropriate notation. 


A.37. Remark. We express summation using )°, the uppercase Greek letter 
“sigma”. When a and b are integers, the value of os f(i) is the sum of the 
numbers f(i) over the integers i satisfying a <i < b. Herei is the index of 
summation, and the formula f(i) is the summand. 

We write >) <5 f(j) to sum a real-valued function f over the elements of a 
set S in its domain. When no subset is specified, as in }>, x;, we sum over the 
entire domain. When the summand has only one symbol that can vary, we may 
omit the subscript on the summation symbol, as in }) x;. 

Similar comments apply to indexed products using [], which is the upper- 
case Greek letter “pi”. | 


Two simple rules help organizing the counting of finite sets by breaking 
problems into subproblems. These rules follow from the definition of size and 
properties of bijections. 


A.38. Definition. The rule of sum states that if A is a finite set and By,..., B,, 
is a partition of A, then |A| = )>;_, |By|: 
Let T be a set whose elements can be described using a procedure 
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involving steps S;...., 5, such that step S; can be performed in 7; ways, 
regardless of how steps S;,...,5;-1 are performed. The rule of product 
states that |T| = []/_, ri. 


For example, there are g‘ lists of length k from a set of size g. There are 
q choices for each position, regardless of the choices in other positions. By the 
product rule, there are g* ways to form the k-tuple. 


A.39. Definition. A permutation of a finite set S is a bijection from S to S. 
The word form of a permutation f of [n] is the list f(1),..., f(n) in that 
order. An arrangement of elements from a set S is a list of elements of 
S (in order). We write n! (read as “n factorial”) to mean [];_, /, with the 
convention that 0! = 1. 


The word form of a permutation of [n] includes the full description of the 
permutation. For counting purposes we refer to the word form as the permuta- 
tion; thus 614325 is a permutation of [6]. With this viewpoint, a permutation 
of [n] is an arrangement of all the elements of [n]. 


A.40. Theorem. An n-element set hasn! permutations (arrangements without 
repetition). In general, the number of arrangements of k distinct elements 
from a set of sizen is n(n —1)---(#—k +1). 


Proof: We count the lists of k distinct elements from a set S of size n. There is 
no such list when k > n, which agrees with the formula. We construct the lists 
one element at a time, specifying the element in position i + 1 after specifying 
the elements in earlier positions. 

There are n ways to choose the image of 1. For each way we do this, there 
are n — 1 ways to choose the image of 2. In general, after we have chosen the 
first i images, avoiding them leaves n — i ways to choose the next image, no 
matter how we made the first i choices. The rule of product yields Mi (n —i) 
for the number of arrangements. | 


Often the order of elements in a list is unimportant. 


A.41. Definition. A selection of k elements from [n] is a k-element subset of 
[n]. The number of such selections is “n choose k”, written as (/). 


Ifk < Oork > n, then 


0; in these cases there are no selections of k 
elements from [n]. When 0 n 


() 
<k , we obtain a simple formula. 


= 


A.42. Theorem. For integers n, k with 0 <k <n, (") = A] ]ij@—i). 


Proof: We relate selections to arrangements. We count the arrangements of k 
elements from [n] in two ways. Picking elements for positions as in Theorem 
A.40 yields n(n — 1) -(n —k + 1) as the number of arrangements. 
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Alternatively, we can select the k-element subset first and then write it in 
some order. Since by definition there are (7) selections, the product rule yields 
(7k! for the number of arrangements. 

In each case, we are counting the set of arrangements, so we conclude that 
n(n — 1)---(n—k +1) = (k!. Dividing by k! completes the proof. a 


The formula for (/) can be written as CE but the form in the statement 
of Theorem A.42 tends to be more useful, especially when k is small. For exam- 
ple, (5 )= = n(n — 1)/2 and (3) = n(n —1)(n —2)/6, the former being the number of 
edges in a complete graph with n vertices. This form more directly reflects the 
counting argument and cancels the (n — k)! appearing in both the numerator 
and denominator. 

The numbers (7) are called the binomial coefficients due to their appear- 
ance as coefficients in the nth power of a sum of two terms. 


A.43. Theorem. (Binomial Theorem) For n € N, (x + y)” = )-y_ (. )x ae 


Proof: The proof interprets the process of multiplying out the factors in the 
product (x + y)(x+y)---(x+y). To form a term in the product, we must choose 
x or y from each factor. The number of factors that contribute x is some integer 


k in {0,...,n}, and hae remaining n —k factors contribute y. The number of 
terms of the form x‘y"~* is the number of ways to choose k of the factors to 
contribute x. Summing over k accounts for all the terms. 5 
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Using the definition of size and the-composition of bijections, it follows that 
finite sets A and B have the same size if and only if there is a bijection from A 
to B. Thus we can compute the size of a set by establishing a bijection from it 
to a set of known size. 

Simple examples include the statements that a complete graph has (}) 


edges and that therefore there are 2(2) simple graphs with vertex set [n]. Propo- 
sition 1.3.10 uses a bijection to count 6-cycles in the Petersen graph. Exercise 
1.3.32 uses a bijection to count graphs with vertex set [n] and even vertex de- 
grees. Theorem 2.2.3 uses a bijection to count trees with vertex set [n]. 


A.44, Lemma. For n € N, the number of subsets of [n] with even size equals 
the number of subsets of [n] with odd size. 


-Proof: Proof 1 (bijection). For each subset with even size, delete the element n 
if it appears, and add n if it does not appear. This always changes the size by 
1 and produces a subset with odd size. The map is a bijection, since each add 
subset containing n arises only from one even subset omitting n, and each odd 
subset omitting n arises only from one even subset containing n. 

Proof 2 (binomial theorem). Setting x = —1 and y = 1 in Theorem A.43. 
yields )"7_9 ({)(—D* = (-14+1)" = 0. (Note that we proved Theorem A.43 using 
bijections.) | 
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We prove a few identities involving binomial coefficients to illustrate com- 
binatorial arguments involving bijections and the idea of counting a set in two 
ways. We can prove an equality by showing that both sides count the same set. 
A.45, Lemma. (7) = (,",). 

Proof: Proof 1 (counting two ways). By definition, [n] has (7) subsets of size 
k. Another way to count selections of k elements is to count selections of n — k 
elements to omit, and there are (,”,) of these. 

Proof 2 (bijections). The left side counts the k-element subsets of [n], the 
right side counts the n — k-element subsets, and the operation of “complemen- 
tation” establishes a bijection between the two collections. | 


Often, “counting two ways” means grouping the elements in two ways. 
Sometimes one of the counts only gives a bound on the size of the set. In this 
case the counting argument proves an inequality; there are several instances 
of this phenomenon in Chapter 3 (see also Exercise 1.3.31). Here we stick to 
equalities. 

A.46. Lemma. (The Chairperson Identity) k(") = n("7}). 

Proof: Each side counts the k-person committees with a designated chairperson 
that can be formed from a set of n people. On the left, we select the committee 
and then select the chair from it; on the right, we select the chair first and then 
fill out the rest of the committee. 2 a 
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Many students see the next formula as the first application of induction, 
but it also is easily proved by counting‘a set in two ways. 


A.47, Lemma. )7y_, i =“. 


Proof: The right side is ("5"); we can view this as counting the nontrivial inter- 
vals with endpoints in the set {1,..., +1}. On the other hand, we can group 
the intervals by length; there is one interval with length n, two with length 
n — 1, and so on up to n intervals with length 1. a 
Lemma A.47 generalizes to )7j_, {j) = ([1}). To prove this by counting in 
two ways, partition the set of k + 1-element subsets of [n + 1] into groups so 
that the size of the ith group will be (;). 
Finally, a recursive computation for the binomial coefficients. 
A.48. Lemma. (Pascal’s Formula) If n > 1, then (") = (";") + Ca) 
Proof: We count the k-sets in [n]. There are (""’) such sets not containing n 


and (771) such sets containing n. a 


Given the initial conditions for n = 0, which are (0) = 1 and (?) = 0 for 
k 4 0, Pascal’s Formula can be used to give inductive proofs of many statements 
about binomial coefficients, including Theorems A.42—-A.43. 
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A.49. Remark. Multinomial coefficients. Binomial coefficients and the bino- 
mial theorem generalize to multinomials. When )°n; = n, the multinomial 
coefficient (, ” ,,,) is the coefficient of []x;" in the expansion of (So i)". It 
has the value n!/[]n;!. Terms of the form |] x;" arise in the expansion only when 
>_n; =n. Otherwise, there is nothing to count, and we say that one, = 
when on; #n. 

The contributions to this coefficient correspond to n-tuples that are ar- 
rangements of n objects, using n,; copies of object i for each i. Having a copy 
of object i in position j corresponds to choosing the term x; from the jth factor 
(xy +--+ +X). 

The formula n!/[|n;! is derived by counting these arrangements. There 
are n! arrangements of n distinct items. If we view these objects as distinct, 
then we count each arrangement [| n,! times, since permuting the copies of a 
single object does not change the arrangement. 

In Corollary 2.2.4, these arrangements correspond to trees with vertex set 
[n] and specified vertex degrees. When we set x; = 1 for all i, we obtain the 
total number of n-tuples formed from k types of letters over all multiplicities of 
repetition; the result is k”. a 


RELATIONS 
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Given two objects s and r, not necessarily of the same type, we may ask 
whether they satisfy a given relationship. Let S denote the set of objects of the 
first type, and let T denote the set of objects of the second type. Some of the 
ordered pairs (s, t) may satisfy the relationship, and some may not. The next 
definition makes this notion precise. 


A.50. Definition. When S and 7 are sets, a relation between S and T is a 
subset of the product S x T. Arelation on S is a subset of S x S. 


We usually specify a relation by a condition on pairs. In Section 1.1, we 
define several relations associated with a graph G. The incidence relation be- 
tween S = V(G) and T =. E(G) is the set of ordered pairs (v,e) such that 
v € V(G), e € E(G), and v is an endpoint of edge e. The adjacency relation on 
the set V(G) is the set of ordered pairs (x, y) of vertices such that x and y are 
the endpoints of an edge. 


A.51. Remark. Let R be a relation defined on a set S. When discussing several 
items from S, we use the adjective pairwise to specify that each pair among 
these items satisfies R. Thus we can talk about a family of pairwise disjoint 
sets, or a family of pairwise isomorphic graphs. An independent set in a graph 
is a set of pairwise nonadjacent vertices. A set of distinct objects is a set of 
pairwise unequal objects. 

We need the term “pairwise” because the relation is defined for pairs. For 
the same reason, we don’t use “pairwise” when discussing only two objects. 
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When two graphs are isomorphic, we don’t say they are pairwise isomorphic. 
Similarly, we say that the endpoints of an edge are adjacent, not pairwise ad- 
jacent; the adjacency relation is satisfied by certain pairs of vertices. a 


To specify a relation between S and 7, we can list the ordered pairs sat- 
isfying it. Usually it is more convenient to let § index the rows and T the 
columns of a grid of positions called a matrix. We can then specify the rela- 
tion by recording, in the position for row s and column t¢, a 1 if (s,t) satisfies 
the relation and a 0 if (s, t) does not satisfy the relation. Thus the adjacency 
and incidence matrices of a graph are the matrices recording the adjacency and 
incidence relations (see Definition 1.1.17). 

The condition “have the same parity” defines a relation on Z. If x, y are 
both even or both odd, then (x, y) satisfies this relation; otherwise it does not. 
The key properties of parity lead us to an important class of relations. 


A.52. Definition. An equivalence relation on a set S is a relation R on § 
such that for all choices of distinct x, y,z € S, 
a) (x, x) € R (reflexive property). 
b) (x, y) € R implies (y, x) € R (symmetric property). 
c) (x, y) € R and (y,z) € R imply (x, z) € R (transitive property). 
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For every set S, the equality relation R = {(x, x): x € S}is an equivalence 
relation on S. In Proposition 1.1.24 we show that the isomorphism relation is an 
equivalence relation on graphs. The notation G = H for this relation suggests 

“equal in some sense”. 
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A.53. Definition. Given an equivalence relation on S, the set of elements 
equivalent to x € S is the equivalence class containine x. 


The equivalence classes of an equivalence relation on S form a partition 
of S; elements x and y belong to the same class if and only if (x, y) satisfies 
the relation. The converse assertion also holds. If A;,..., Ay, is a partition of 
S, then the condition “x and y are in the same set in the partition” defines an 
equivalence relation on S. 

Parity partitions the integers into two equivalence classes by their rzinain- 
der upon division by 2. This notion generalizes to any natural number. 


A.54. Definition. Given a natural number n, the integers x and y are congru- 
ent modulo n if x — y is divisible by n. We write this as x = y mod n. The 
number n is the modulus. 


A.55. Theorem. For n € N, congruence mod # is an equivalence relation on Z. 


Proof: Reflexive property: x — x equals 0, which is divisible by n. 

Symmetric property: If x = y mod n, then by definition n|(x — y). Since 
y —x = —(x — y), and since n divides —m if and only ifn divides m, we also have 
n|(y — x), and hence y = x mod n. 
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Transitive property: If n|(x — y) and n|(y — z), then integers a, b exist such 
that x — y = an and y —z = bn. Adding these equations yields x — z = an+bn = 
(a + b)n, so n|(x — z). Thus the relation is transitive. | 


A.56. Définition. The equivalence classes of the relation “congruence modulo 
n” on Z are the remainder classes or congruence classes modulo n. The 
set of congruence classes is written as Z, or Z/nZ. 


There are n remainder classes modulo n. For 0 <r <n, the rth class in 
Zn is {kn +r: k € Z}. Numbers a and B lie in the rth class if and only if they 
both have remainder r upon division by n. Thus “m =r mod n” has the same 
meaning as “m is r more than a multiple of n”. 


THE PIGEONHOLE PRINCIPLE 


The pigeonhole principle is a simple notion that leads to elegant proofs 
and can reduce case analysis. In every set of numbers, the average is between 
the minimum and the maximum. When dealing with integers, the pigeonhole 
principle allows us to take the ceiling or floor of the average in the desired 
direction. 


1 book 


A.57. Lemma. (Pigeonhole Principle) F Ifa set consisting of more than kn objects 
is partitioned into n classes, then some class receives more than k objects. 


Proof: The contrapositive states that if every class receives at most k objects, 
then in total there are at most kn objects. a 


The pigeonhole principle can reduce case analysis by allowing us to use 
additional information about an extreme element of a set. This simple idea 
can crop up unexpectedly, but its use can be quite effective. When we find that 
we need the pigeonhole principle, there is no trouble applying it: we need a 
sufficiently big value in our set, and the pigeonhole principle provides it. 

Some applications of the pigeonhole principle are rather subtle. Section 
8.3 presents several of these. The subtlety arises when it is unclear how to 
define the objects and the classes so that the pigeonhole principle will apply. 

Proposition 1.3.15 proves the next proposition using Remark A.13. Here 
we use the pigeonhole principle instead. 


A.58. Proposition. If G is a simple n-vertex graph with 5(G) > (n — 1)/2, then 
G is connected. 


Proof: Choose u, v € V(G). Ifu # v, then at least n — 1 edges join {u, v} to the 
remaining vertices, since 6(G) > (n — 1)/2. There are n — 2 other vertices, so 
the pigeonhole principle implies that one of them receives two of these edges. 
Since G is simple, this vertex is a common neighbor of u and v. 

For every two vertices u,v € V(G), we have proved that u and v are adja- 
cent or have a common neighbor. Thus G is connected. a 
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The pigeonhole principle can also be useful in statements about trees, 
where the number of vertices is one more than the number of edges. If each 
vertex selects an edge in some way, then some edge must be selected twice. 
The idea is to design the selection so that when an edge is selected twice, the 
desired outcome occurs. Applications of this idea occur in Lemma 8.1.10 and 
Theorem 8.3.2. 

The pigeonhole principle is the discrete version of the statement that-the 
average of a set of numbers is between the minimum and the maximum. This 
statement is made explicit for vertex degrees in Corollary 1.3.4. Other applica- 
tions are sprinkled throughout the book. 
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Appendix B 
Optimization and Complexity 


A salesman plans to visit n — 1 other cities and return home. The natural 
objective is to minimize the total travel time. If we assign each edge of K,, a 
weight equal to the travel time between the corresponding cities, then we seek 
the spanning cycle of minimum total weight. This is the famous Traveling 
Salesman Problem (TSP). Seemingly analogous to the Minimum Spanning 
Tree problem, the TSP as yet has no good algorithm. 

Similarly, although we have a good algorithm for finding maximum match- 
ings, we have none for finding the maximum size of an independent set of 
vertices. Since the former is the special case of the latter for line graphs, it is 
not too surprising that it is easier to solve. 


INTRACTABILITY 


We defined a good algorithm (Definition 3.2.3) to be an algorithm that runs 
(correctly) in time bounded by a polynomial function of the input size. One 
algorithm for the TSP considers all spanning cycles and selects the cheapest 
one. This is not a good algorithm, because K,, has (n — 1)!/2 spanning cycles, 
and this has grows faster than every polynomial function of n. The computation 
takes too long for graphs of any substantial size. Practical applications require 
solving TSPs on graphs with hundreds or thousands of vertices. 

No one has found a good algorithm, and no one has proved that none exists. 
The TSP belongs to a large class of problems having the property that a good 
algorithm for any one of them will yield a good algorithm for every one of them. 
A good algorithm for B yields a good algorithm for A if we can “reduce” problem 
A to problem B. 

As an easy example of this, we can use a good algorithm for the TSP (prob- 
lem B) to recognize Hamiltonian graphs (problem A). From a graph G, form an 
instance of the TSP on vertex set V(G) by assigning weight 0 to vertex pairs 
that are edges of G and weight 1 to pairs that are not. The graph G has a 
Hamiltonian cycle if and only if the optimal solution to this instance of the TSP 
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has cost 0. The time for the transformation is polynomial in n(G), so a good al- 
gorithm for the TSP produces a good algorithm to test for spanning cycles. We 
conclude that the TSP is at least as hard as the Hamiltonian cycle problem. 

In the formal discussion, we consider only decision problems, where the 
answer is YES or NO. This makes sense for recognizing Hamiltonian graphs, 
but the TSP is an optimization problem. When formulated as a decision problem 
(called MINIMUM SPANNING CYCLE), the input for the TSP is a weighted 
graph G and a number kf, and the problem is to test whether G has a spanning 
cycle with weight at most k. Repeated applications of this decision problem (at 
most a polynomial number of applications) can be used to find the minimum 
weight of a spanning cycle. Similarly, MAXIMUM INDEPENDENT SET takes 
a graph G and an integer k as input and tests a(G) > k. 

We judge a graph algorithm by its maximum (worst-case) running time 
over inputs on n vertices, as a function of n. The complexity of a decision 
problem is the minimum worst-case running time over all solution algorithms, 
again as a function of the size of the problem.t In describing the growth of a 
function g, we compare it with a reference function f. We define several sets 
of functions in terms of f. The sets O( f) and Q(f).describe functions bounded 
above and below by multiples of f. Functions in ©(/) grow at about the same 
rate as f, those in o( f) grow more slowly, and those in w(f) grow more quickly. 


O(f) = {g: 3c,a € R such that |g(x)| < cl f(x)|forx > a} 
Q(f) = {g: dc,a € R such that |g(x)| > c|f(x)|forx > a} 
@Q(f) = O(fAINA(F) =. 

of) = {g: le@I/IF@)| > 9} 

of) = {g: lg@I/I|f > 9} 


The class of problems with polynomial complexity (solved by a good algo- 
rithm) is called “P”. We have discussed only deterministic algorithms: each 
input leads to exactly one polynomial-time computation. 

Now we consider nondeterministic algorithms. For many decision prob- 
lems with no known good algorithm, short proofs exist for YES answers. For 
example, if we guess the right order of vertices in the HAMILTONIAN CYCLE 
problem (specified by a sequence of O(n logn) bits), then we can verify rapidly 
that this order forms a spanning cycle. 

A nondeterministic polynomial-time algorithm tries all values of a 
polynomial-length sequence of bits simultaneously, applying a polynomial-time 
computation to each guess (polynomial in the length of the input). If any guess 
demonstrates a YES answer to the decision problem, then the algorithm says 
YES. Otherwise, the answer is NO. This amounts to saying that when the an- 
swer is YES, there is a polynomial-time proof of this. The nondeterminism lies 
not in the answer but in the choice of the computation path. 


Technically, the size of a problem instance is its length in some encoding of the prob- 
lem. Measuring the size of a graph problem by the number of vertices suffices for 
our purposes. A polynomial in n is also a polynomial in n? or n°, so the distinction is 
unimportant unless the problem involves exponentially large edge weights. 
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The class of problems solvable by nondeterministic polynomial-time algo- 
rithms is called “NP”. A machine that has the power to follow many computation 
paths in parallel can also follow one; hence PC NP. It is commonly believed 
that P4 NP. This has not been proved, so NP cannot be taken to mean “non- 
polynomial”. Instead, we use the informal term intractable for the problems 
in NP that are essentially as hard as all the problemsin NP. 

A problem is NP-hard if a polynomial-time algorithm for it could be used 
to construct a polynomial-time algorithm for each problem in NP. It is NP- 
complete if it belongs to NP and is NP-hard. If some NP-complete problem 
belongs to P, then P= NP. No polynomial-time algorithm is known for any of the 
many NP-complete problems. This supports the prevailing belief that P4 NP. 
Garey—Johnson [1979] presents a thorough introduction to this topic. 

Given one NP-complete problem, NP-completeness of other problems fol- 
lows by reduction arguments as suggested earlier. We present several such 
arguments in this appendix. Here we list complexities for some of the problems 
discussed in this book. 

Standard style in computer science uses uppercase names for decision prob- 
lems. For a problem whose name includes an optimization, the decision problem 
is testing whether the value is as extreme as a number given as part of the in- 
put. Parameters in the name, poyevenere fixed as part of the statement of the 
problem. 

This is an important distinction. For example, k-INDEPENDENT SET for 
fixed k is in P, since the number of k-sets of vertices is a polynomial in nv of 
degree k when k is fixed, and we can simply test them all for independence. On 
the other hand, MAXIMUM INDEPENDENT SET is NP-complete; this is the 
problem of testing whether G has an independent set of size at least k where k 
is part of the input (and can grow with n). 


Problems in P NP-complete Problems 

k-INDEPENDENT SET MAXIMUM INDEPENDENT SET 
GIRTH (SHORTEST CYCLE) CIRCUMFERENCE (LONGEST CYCLE) 
EULERIAN CIRCUIT HAMILTONIAN CYCLE 

DIAMETER LONGEST PATH, HAMILTONIAN PATH 
CONNECTIVITY 

2-COLORABILITY k-COLORABILITY (for any fixed k > 3) 
MAXIMUM MATCHING A(G)-EDGE-COLORABILITY 
PLANARITY GENUS 


NP-completeness is related to the lack of an easily testable necessary and 
sufficient condition for YES answers. A good characterization is a character- 
ization by a condition checkable in polynomial time. The characterization for 
Eulerian graphs is good, and GIRTH, DIAMETER, and 2-COLORABILITY 
can all be solved in polynomial time using Breadth-First Search. Polyno- 
mial behavior is less obvious for CONNECTIVITY, but min-max relations like 
Menger’s Theorem generally lead to polynomial-time optimization algorithms, 
often based on network flow methods (Section 4.3). 
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HEURISTICS AND BOUNDS 


Our traveling salesman still awaits instruction. NP-completeness does not 
eliminate the need for an answer. We seek heuristic algorithms that find solu- 
tions close to optimal. Perhaps we can prove a guarantee about how far from 
the optimum the result may be. For example, we may be content to have a so- 
lution whose cost is at most twice the optimum, if we have an algorithm that 
can quickly generate such a solution. An approximation algorithm always 
generates a solution whose ratio to the optimum is bounded by a constant.? 

Greediness is a simple heuristic. For the minimum spanning tree problem, 
the result is optimal. On other problems, greedy algorithms may perform very 
badly. Consider MAXIMUM INDEPENDENT SET. We may generate an inde- 
pendent set iteratively by picking a vertex and deleting it and its neighbors. 
How should we pick the next vertex? If we always choose right, then the result 
is a maximum independent set. A greedy heuristic is to pick a vertex of mini- 
mum degree in what remains, since this leaves the largest set of candidates for 
the independent set. The result can be arbitrarily bad. 


B.1. Example. Defeating the greedy algorithm. Consider (K, + Km) V Km. This 
graph has one vertex of degree m, m vertices of degree m + 1, and m vertices of 
degree 2m — 1. The greedy heuristic picks the vertex of minimum degree and 
deletes it and its neighbors, leaving a clique. Hence the greedy algorithm finds 
an independent set of size 2, when in fact a(G) =m. a 


a 
3) 


o 


a 
a 


Nevertheless, the greedy algorithm works well on large graphs generated 
randomly. In this model (see Section 8.5), it almost always finds an independent 
set of size at least half the maximum. Exercise 12 presents two heuristics for 
MINIMUM VERTEX COVER; one fails as in Example B.1, but the other yields 
an approximation algorithm. 

Next we consider simple heuristics for the TSP, where {v1,..., v,} are the 
vertices and w;; denotes the weight (cost) of edge v;v;. From an arbitrary 
starting vertex, it seems reasonable to move to a new vertex via the least- 
cost incident edge. We iteratively move to the closest unvisited neighbor of 
the current vertex. This is a “greedy” algorithm and runs quickly. It is the 
nearest-neighbor heuristic. 


TBetter yet: an approximation scheme is a family of algorithms indexed by a param- 
eter €, say € = 1/k for the kth algorithm, such that the kth algorithm has performance 
ratio bounded by 1+¢. Each algorithm has polynomial complexity, but the degree of the 
polynomial grows with k. Improving the performance ratio requires more time. 
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B.2. Example. Failure of the Nearest-neighbor heuristic. Consider a TSP with 
weight 0 on a Hamiltonian path P, weight n? on all other edges incident to the 
endpoints of P, and weight 1 on all remaining edges. This example has many 
spanning cycles of weight n, but the nearest-neighbor heuristic yields a cycle of 
weight at least n? from any starting vertex. Thus the cost of the cycle produced 
by the algorithm is not bounded by a constant multiple of the optimal cost, and 
it is not an approximation algorithm. a 


There are many similar heuristics. We could try to grow a cycle one vertex 
at a time, greedily absorbing the vertex whose insertion in the cycle causes the 
least increase in cost. This nearest-insertion heuristic has a better chance 
than nearest-neighbor, because at stage i of the nearest-neighbor heuristic we 
make a choice among n — i alternatives, whereas at stage i in nearest-insertion 
we choose among (n — i)i alternatives (which to add and where to insert it). 
Nevertheless, this also is not an approximation algorithm (Exercise 7). 

Another approach is to start with a candidate spanning cycle and try to 
improve it. Maintaining a feasible solution (an actual cycle) and considering 
small changes to improve it is called local search. Allowing changes takes us 
beyond greedy algorithms and may perform better. 

To improve the current cycle, we.consider changing a pair of edges. If 
(v1,..., Un) is our cycle, we could substitute the edges v;0; and vj410j;41 for 
v,Vj41 and v;v;41 to obtain a new cycle=(the other possible switch leads to two 
disjoint cycles instead of one cycle). The switch is beneficial if w, ; + wi41.j41 < 


Wi,i41+W; ;41. The current cycle has (3)— n= ("5’) pairs of nonincident edges to 
consider switching. The algorithm of Lin—Kernighan [1973], which has proved 
remarkably difficult to improve in practice, considers switches among three 


edges at a time. 


Vi Ui41 


a 


The following theorem seems to doom efforts to find an approximation al- 
gorithm for the general TSP. 


B.3. Theorem. (Sahni—Gonzalez [1976]) If there is a constant c > 1 anda 
polynomial-time algorithm A such that A produces for each instance of the 
TSP a spanning cycle with cost at most c times the optimum, then P=NP. 


Proof: We show that such an algorithm A could be used to construct a 
polynomial-time algorithm for HAMILTONIAN CYCLE, which is NP-complete 
(Corollary B.11). Given an n-vertex graph G, construct an instance of TSP on 
the same vertex set by letting w;; = 1 if vjv; € E(G), and w;; = cn otherwise. 
In this instance of TSP, every spanning cycle with cost at most cn has cost 
exactly n and corresponds to a spanning cycle of the original input G. Since A 
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produces a solution with cost at most c times the optimum, it produces one of 
cost n if and only if G has a spanning cycle. 

Thus our polynomial algorithm for HAMILTONIAN CYCLE generates an 
instance of TSP as stated and runs A on this instance. The graph G is Hamil- 
tonian if and only if A produces a cycle of cost n. | 


Approximation algorithms do exist for some special classes of TSP prob- 
lems. To prove that an algorithm is an approximation algorithm, we need a 
lower bound on the optimal solution. 

Let M be the cost of a minimum spanning tree in a weighted graph G. If we 
delete an edge from the optimal cycle in G, we obtain a spanning path. Since 
this is a spanning tree, its cost is at least M. The cost of the optimal cycle is at 
least M plus the minimum cost of an edge not in some tree with cost M. We can 
run Kruskal’s Algorithm for minimum spanning trees to compute this bound. 


B.4. Theorem. On the class of Traveling Salesman Problems where the input 
satisfies the triangle inequality, there is an approximation algorithm that 
finds a spanning cycle with cost at most twice the optimum. 


Proof: Satisfying the triangle inequality means that the matrix of costs satis- 
fies w;,; + wjx > wi for all i, j,k. We-know that the cost of the optimal cycle 
is at least M, where M is the cost of the minimum spanning tree. We use the 
triangle imequaliyy and the minimum spanning tree to obtain a spanning cycle 
with cost at most 2M. 

Begin by duplicating each edge in a minimum spanning tree. As on the 
left below. This makes all degrees even, so there is an Eulerian circuit; the 
circuit has 2n edges and total cost 2M. We successively reduce the number of 
edges without increasing the cost until only n edges remain. By maintaining 
the property that the circuit visits all vertices, we ensure that the circuit at the 
end is a spanning cycle with cost at most 2M. 

If a circuit has more than n edges, then it visits some vertex more than once, 
say via edges v; > v; > uy and v, > v; > vs. Replace the edges u,v; and vjr, 
with v;u,. The result is still a circuit visiting all the original vertices. Further- 
more, the triangle inequality guarantees that the total cost of the edges is no 
larger than before. In the figure below, we orient edges to suggest a particular 
succession of circuits. a 


The algorithm of Theorem B.4 was rediscovered many times. Christofides 
[1976] improved the performance ratio to 3/2. After finding a minimum span- 
ning tree, we needn’t double all the edges to obtain an Eulerian circuit. It 


suffices to add edges to pair vertices that have odd degree in the tree. The re- 
sulting graph has an Eulerian circuit, and the subsequent part of the algorithm 
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produces a spanning cycle with cost at most M plus the cost of the matching. 
To obtain the performance ratio 3/2, it then suffices to show that there is such 
a matching with cost at most half the cost of the optimal cycle (Exercise 8). The 
improvement has value because a minimum weight matching saturating the 
vertices of odd degree in the tree can be found in polynomial time. 


NP-COMPLETENESS PROOFS 


A transformation of problem A to problem B is a procedure that converts 
instances of A into instances of B so that the answer to A on the original in- 
stance is determined by the answer to B on the transformed instance. If we 
have an efficient (polynomial-time) transformation of A to B and an efficient al- 
gorithm for B, then we have an efficient algorithm for A. We say that A reduces 
to or transforms to B. 

If Ais NP-hard and reduces to B by a polynomial-time transformation, then 
B is also NP-hard (a polynomial algorithm for B yields a polynomial algorithm 
for A and hence for all of NP). If also B is in NP, then we say that B is NP- 
complete by reduction from or transformation from A. 

The direction of the reduction is crucial. For example, EULERIAN CIR- 
CUIT reduces easily to HAMILTONIAN CYCLE. Given an input graph G, 
replace each edge with a path of four edges through three new vertices, add 
edges to make the neighbors of each original vertex pairwise adjacent, and 
delete V(G). The graph G is Eulerian if and only if the resulting graph G’ is 
Hamiltonian. Applying an algorithm for HAMILTONIAN CYCLE to G’ would 
determine whether G is Eulerian. This tells use that HAMILTONIAN CY- 
CLE is at least as hard as EULERIAN CIRCUIT, up to a polynomial factor. 
Since EULERIAN CIRCUIT is easy (in P), we learn nothing of use about the 
complexity of HAMILTONIAN CYCLE. 
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The reduction technique requires an initial NP-complete problem. Cook 
[1971] provided SATISFIABILITY as such a problem. SATISFIABILITY takes 
as input a logical formula expressed as a list of clauses; each clause is a collec- 
tion of literals (variables or their negations). A clause is considered true when 
at least one of its literals is true. A formula is satisfiable if there is an as- 
signment of truth values to the variables that makes every clause true. The 
question is whether such an assignment exists. 

Cook proved that for every problem A in NP, an instance of SATISFIA- 
BILITY can be produced in polynomial time from an instance of A such that. 
the answer to the SATISFIABILITY instance is the same as the answer to the 
instance of A. Thus every problem in NP reduces to SATISFIABILITY. 
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This effort need not be repeated for each NP-complete problem. To prove 
that B is NP-hard, we can reduce SATISFIABILITY to B. Any NP-complete 
problem may be reduced to show that B is NP-hard. With more options, in 
principle the proofs become easier to find, but in practice a few fundamental 
NP-complete problems serve as the known NP-complete problem in most NP- 
completeness proofs. 

Starting from SATISFIABILITY, Karp [1972] provided 21 such problems. 
These include many fundamental problems of graph theory, including HAMIL- 
TONIAN CYCLE and MAXIMUM INDEPENDENT SET. It helps to have as 
restrictive a version of an NP-complete problem as possible while still remain- 
ing NP-complete. Since a restricted version has less flexibility, it is easier to 
reduce it to the problem we are trying to prove NP-complete. 

For example, SATISFIABILITY remains NP-complete when restricted by 
requiring that every clause have three literals. The restricted problem is called 
3-SATISFIABILITY or 3-SAT. This is proved NP-complete by considering an 
arbitrary instance of SATISFIABILITY and replacing each clause by an equiva- 
lent collection of clauses with three literals (at the cost of introducing some addi- 
tional variables). 3-SAT is sufficiently restrictive that-many, NP-completeness 
proofs use reduction from 3-SAT. It is easier yet to reduce the more restrictive 
2-SAT, but this is so restrictive that 2-SAT is solvable in polynomial time. 

We start from 3-SAT because the: NP-completeness of SATISFIABILITY 
and the reduction of it to 3-SAT do notinvolve graph theory. Our reductions to 
3-COLORABILITY and DIRECTED HAMILTONIAN PATH follow the presen- 
tation of Gibbons [1985]. 
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B.5. Definition. 3-SATISFIABILITY (3-SAT) 
Instance: A set of logical variables U = {u;} andaset C = {C;} ofclauses, 
where each clause consists of three literals, a literal being a variable u; or 
its negation u;. 
Question: Can the value of each variable be set to True or False so that 
each clause is “satisfied” (contains at least one true literal)? 


B.6. Theorem. (Karp [1972]) 3-SAT is NP-complete. 
Proof: (Exercise 14), a 


B.7. Theorem. (Stockmeyer [1973]) 3-COLORABILITY is NP-complete. 


Proof: In this problem, we are given a graph and asked whether is it 3-colorable. 
If the answer is YES, then there exists a proper 3-coloring, and we can verify in 
quadratic time that the coloring is proper. Hence 3-COLORABILITY is in NP. 
To prove that it is NP-hard, we reduce 3-SAT to 3-COLORABILITY. 

Consider an instance of 3-SAT with variables U = {u;} and clauses C = 
{C;}. We transform this into a graph G that is 3-colorable if and only if the 
instance of 3-SAT is satisfiable. We use the auxiliary graph H below, calling 
{u}, Uo, U3} the inputs and v the Gutput. When we use H in the transformation, 
we will attach it to a larger graph at the inputs, as suggested on the right. 
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We consider 3-colorings using the color set {0, 1, 2}. Every proper 3-coloring of 
H in which the inputs all have color 0 also assigns color 0 to v. On the other 
hand, if the inputs receive colors that are not all 0, then this coloring extends 
to a proper 3-coloring of H in which v does not have color 0. 

From our instance of 3-SAT, we construct a graph G having vertices u; and 
u; for each variable in U, a copy H; of H for each clause C;, and two special 
vertices a, b. For each j, the vertices a, u;,u; form a triangle. For each clause 
C;, the subgraph H; attaches to the graph formed thus far at the vertices for 
the literals in C;. The vertex for the jth literal in C; plays the role of uj in Hj. 
Except for these attachment vertices, the subgraphs H; are pairwise disjoint. 
Finally, the vertex b is adjacent to a and to the output vertex v; for each H;. 
Below we draw the graph G resulting from an instance of 3-SAT having four 
variables and three clauses. 


A satisfying truth assignment for {c;} yields a proper 3-coloring f of G. 
When uy; is true in the assignment, set f(u;) = 1 and f(u;) = 0; otherwise, 
set f(u;) = 0 and f(z;) = 1. For each clause, some literal is true; hence for 
each H; at least one of {u',, u,, u3} has color 1. By our observation about H, we 
can extend f so that each v; has a color other than 0. We complete the proper 
3-coloring by setting f(b) = 0 and f(a) = 2. 

Conversely, suppose that G has a proper 3-coloring f. By renaming colors 
if necessary, we may assume that f(a) = 2 and f(b) = 0. Since f(a) = 2, 
for each variable we have one literal colored 0 and one colored 1. Consider 
the truth assignment in which variable u; is true or false when f(u;) is 1 or 
0, respectively. We claim that this is a satisfying truth assignment. Since 
f(b) = 0, every output vertex v; has nonzero color. By our observation about H, 
the vertices corresponding to the inputs in H; cannot all have color 0. Therefore, 
each clause contains at least one true literal. a 


Exercise 2 extends this to kCOLORABILITY for fixed k > 3, For each k it 
is a special case of CHROMATIC NUMBER, which thus is also NP-complete. 
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B.8. Theorem. (Karp [1972]) INDEPENDENT SET, CLIQUE, and VERTEX 
COVER are NP-complete. 


Proof: A YES answer to the question of whether an input graph G has an 
independent set as large as an input integer k can be verified by exhibiting the 
set and checking (in quadratic time) that its vertices are independent. Hence 
INDEPENDENT SET is in NP. 

Exercise 5.1.31 states that G is m-colorable if and only if GOK,, has an inde- 
pendent set of size n(G). This transformation reduces CHROMATIC NUMBER 
to INDEPENDENT SET. The construction of GO K,, is quadratic in n(G), since 
CHROMATIC NUMBER is trivial ifm > n. We conclude that INDEPENDENT 
SET is NP-hard. 

Since cliques in G are independent in G, CLIQUE and INDEPENDENT 
SET are polynomially equivalent. Since a(G) + B(G) = n(G) (Lemma 3.1.21), 
INDEPENDENT SET and VERTEX COVER are polynomially equivalent. JS 


We next consider problems of traversing graphs via spanning paths and 
cycles. Digraph problems are more general than graph problems, since we can 
model graphs by using symmetric digraphs. Thus, it may be easiest to prove a 
digraph version of a problem NP-hard and then obtain the graph version by a 
simple restriction. 
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B.9. Definition. Given vertices x, y i a digraph D, the DIRECTED HAM- 
ILTONIAN PATH problem asks whether D has a spanning x, y-path. 


B.10. Theorem. (Karp [1972]) DIRECTED HAMILTONIAN PATH is NP- 
complete. 


Proof: A spanning x, y-path in a digraph D can be verified in linear time. Thus 
DIRECTED HAMILTONIAN PATH is in NP. To show that it is NP-hard, we 
reduce VERTEX COVER to DIRECTED HAMILTONIAN PATH. 

Consider an instance of VERTEX COVER, consisting of a graph G and an 
integer k; we want to know whether G has a vertex cover of size (at most) k. 
We construct a digraph D such that D has a Hamiltonian path if and only if G 
has a vertex cover of size at most k. We index the edges incident to each vertex 
arbitrarily. When the edge e = uv is the ith edge incident to u and the jth edge 
incident to v, we write e;(u) = e = e;(v). 

To build D, we start with k+1 special vertices zo,..., z,. Foreach v € V(G), 
we add a path P(v) = v4,..., ve, to D, where r = dg(v). We add edges from 
each of zo,..., 2-1 to the start of each P(v) and from the end of each P(v) to 


each of z1,..., 2. Also, for each edge e = e;(u) = e;(v) in E(G), we create the 
edges U9j-102j—-1, V2j—1Ugi-1, UajVe;, and v2g;U9;. 
Suppose that G has a vertex cover of size k, consisting of vertices v},..., v*. 


We form a Zo, z.-path in D by following zo, P(v'), z1,..., P(v*), z. This path 
omits all of P(u) for each uncovered vertex u. We absorb these vertices in pairs, 
absorbing each pair u2;_1u42; by making a detour from the path P(v) for the ver- 
tex that covers the edge uv = e;(u) = e;(v). The detour is vg;_1u2;-1u2;v2;, as 
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shown below. Because the vertices v2;_1, v2; are associated with only one edge, 
each such detour is requested at most once. After implementing all the detours, 
we have a Hamiltonian Zo, z,-path in D. 


Ugj;-2 Uaj-1 uo; UQj+1 ugj-2  U2j-1 U9; UQj+1 
e — oo o—_»>—_ 


v2j-2 V2j-1 v2; V2j+1 v2j-2  U2j-1 v2; V2j+1 

Conversely, suppose that there is such a path Q. Note that every vertex 
of P(u) except the first and last has indegree and outdegree 2, for each u € 
V(G). We show first for i > 1 and u € V(G) that uog;u9j41 € E(Q) if and only 
if ug;-9ua;-1 € E(Q), where for i = 1 we take up to mean some element of {z,}. 
Similarly, when i = d(u) we take u9;,1 to mean some element of {z,} Since the 
ith edge incident to u is well-defined, we can let the vertex v and index j be 
such that e;(u) = e;(v). 

If u9;-2uo;-1 ¢ E(Q), then Q must enter ug;-1 from v2;_1. This implies that 
Q can only leave ug;-1 on u9;-1u2; and can only enter v2; on u9v2;. This in turn 
implies that U2;UI;41 ¢ E(Q). 

If ug;-22;-1 € E(P), then P cannotJeave ve;_1 on v2;-149;-1 and must leave 
Vgj-1 ON v2g;-1V2;. This implies that Q € enters v2; ON v2j_102; j and not on u9;02;. 
Hence Q does not leave u9;'on u9;v2; and must leave Q on u9ju9;+1. (In this case, 
Q may include {ug;-149;, vaj—2V2;—-1, V2) v9i41} or {uo;-1V2;—-1, V2;42;}.) 

Now let S = {v € V(G): z;v1 € Q}; these are the k vertices in G whose initial 
copies are entered from zo,..., 2-1 by Q. Our argument shows for each edge 
uv that u ¢ S implies v € S. Hence S isa vertex cover, and we have the desired 
reduction of VERTEX COVER to DIRECTED HAMILTONIAN PATH. a 


B.11. Corollary. (Karp [1972]) DIRECTED HAMILTONIAN CYCLE, HAMIL- 
TONIAN PATH, HAMILTONIAN CYCLE are NP-complete. 


Proof: All these problems are in NP. To reduce DIRECTED HAMILTONIAN 
PATH to DIRECTED HAMILTONIAN CYCLE, add one vertex z and edges vz 
and zu to an instance requesting a spanning u, v-path in D. 

The reduction of HAMILTONIAN PATH (with specified endpoints) to 
HAMILTONIAN CYCLE is the same. . 

To reduced DIRECTED HAMILTONIAN PATH to HAMILTONIAN PATH, 
consider an instance that requests a u, v-path in D. To form an instance G of 
HAMILTONIAN PATH, first split each vertex x into a path x~, x°, xt. Let x7 
inherit all edges with heads at x, and let x* inherit all edges with tails at x. A 
spanning u, v-path in D becomes a spanning u_ , v*-path in G by replacing each 
vertex x with the sequence x~, x°, x*. 

Conversely, since a spanning u~, v*-path in G must visit each. x°, it must 
visit traverse all sequences of the focni x, x°, x+, forwards or backwards. Since 
no vertices of the same sign are adjacent, these traversals must all be in the 
same direction, and then they collapse to the desired u, v-path in D. Thus G 
has a spanning u~, v*-path if and only if D has a spanning u, v-path. a 
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Exploration of the boundary between P and NP-complete seeks more re- 
stricted problems that remain NP-complete and larger classes of inputs on 
which there are polynomial-time solution algorithms. The former provides eas- 
ier NP-completeness proofs and places limits on the successes of the latter type. 
The latter aim is that of extending the applicability of good algorithms. 

We illustrate this process by proving that 3-COLORABILITY remains NP- 
complete for planar graphs. 


B.12. Theorem. (Stockmeyer [1973], see also Garey—Johnson—Stockmeyer 
[1976]) PLANAR 3-COLORABILITY is NP-complete. 


Proof: As usual, it is easy to verify that a 3-coloring is proper. We reduce 3- 
COLORABILITY to PLANAR 3-COLORABILITY. Given an arbitrary graph G, 
we construct a planar graph G’ such that G’ is 3-colorable if and only if G is 
3-colorable. 

Consider a drawing of G in the plane. We replace each crossing by a planar 
“gadget” that has the effect of propagating color across the crossing in a 3- 
coloring. In every proper 3-coloring of the graph H below, u and v have the 
same color, as do x and y (Exercise 17). The partial edges indicate where copies 
of H will be linked to the rest of the graph. 


In the drawing of G, each edge involved in k crossings is cut into k + 1 
segments by the crossings. On each segment, add a new vertex. Replace each 
crossing by a copy of H attached by its terminals to the new vertices on the four 
segments incident to the crossing. Finally, for each original edge wz, choose one 
endpoint and contract the edge between it and the vertex on the segment of wz 
incident to it. An edge involved in no crossings returns to its original state. 


In a proper 3-coloring of the new graph G’, the propagation of color across 
the gadgets requires the endpoints of each original edge to have different colors. 
Hence restricting this coloring to the original vertices yields a 3-coloring of G. 

Conversely, given a proper 3-coloring of G, we can start along each edge 
from the endpoint used in a copy of H and propagate the colors to properly 
3-color G’. We can do this because H has both a proper 3-coloring in which 
u, v, x, y all get the same color and a proper 3-coloring in which x and y have a 
different color from u and v (Exercise 17). a 
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HAMILTONIAN CYCLE also remains NP-complete for planar graphs. In- 
deed, it remains NP-complete for graphs that are planar, 3-regular, 3-connected, 
and have no face of length less than 5 (Garey—Johnson—Tarjan [1976]). Also it is 
NP-complete for bipartite graphs (Krishnamoorthy [1975]) and for line graphs 
(Bertossi [1981]; see also Exercise 14). The restriction of 3-COLORABILITY 
to line graphs is the same as 3-EDGE-COLORABILITY for graphs; this is NP- 
complete even for 3-regular graphs, by reduction from 3-SAT (Holyer [1981]). 


EXERCISES 


B.1. (—) Using algorithms developed earlier in this text, describe a good algorithm to 
compute a(G) when G is bipartite. 


B.2. (!) Use Theorem B.7 to prove that kCOLORABILITY is NP-complete for each fixed 
value of k that is at least 3. 


B.3. Give a polynomial-time algorithm for 2-COLORABILITY. 


B.4. Prove that HAMILTONIAN CYCLE and HAMILTONIAN PATH dre polynomially 
equivalent. That is, show that a polynomial-time algorithm for either one can be used 
to obtain a polynomial-time algorithm for the other. 


B.5. Testing for a cycle of fixed length k in an input graph with n vertices can be done 
in time bounded by a multiple of k!n": look at each of the (7) vertex subsets of size k in 
turn and test all possible orderings. SinceX is a constant, this is polynomial time. For 
a 4-cycle, this runs in time O(n*). Devise an algorithm that will test for the presence of 
a 4-cycle in time O(n”). (Richards—Liestman [1985]) 


B.6. Given a graph G and an integer k, the MINIMUM DEGREE SPANNING TREE 
problem asks whether G has a spanning tree T such that A(T) < k, and the LONGEST 
PATH problem asks whether G has a path of length at least k. In the k-PATH problem, 
k is not part of the input, and the question is whether G has a path of length at least k. 
a) Prove that MINIMUM DEGREE SPANNING TREE and LONGEST PATH are 
NP-complete. 
b) Prove the k-PATH is in P for each fixed k. 


B.7. Construct a family of examples to prove that the performance ratio of the nearest- 
insertion heuristic for the TSP is not bounded by any constant. 


B.8. (!) Consider an instance of TSP where the costs satisfy the triangle inequality. 
Prove that some matching of the odd-degree vertices in a minimum spanning tree uses 
edges with total cost at most half the minimum cost of a spanning cycle. Conclude that 
Christofides’ algorithm has performance ratio at most 3/2. 


B.9. Prove that in order to solve the TSP exactly, it suffices to have an algorithm that 
solves the TSP when the edge weights satisfy the triangle inequality. (Hint: Given an 
arbitrary instance of the TSP, produce in polynomial time an instance of the TSP where 
the edge weights satisfy the triangle inequality (w;; + w;, > w;,) and the set of optimal 
tours is the same as in the original instance.) 


B.10. Prove that 2-SAT belongs to P. 
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B.11. A road system has one snowplow. The narrow city streets must be plowed; the 
plow can clear such streets completely in one traversal. There are also rural roads that 
the plow can use to change position, but these don’t need to be plowed. Thus we havea 
weighted graph with edges of two types; those of type 1 must be traversed, those of type 
2 need not be. The state wants an algorithm that will find a minimum-length circuit 
traversing all type 1 edges in such a graph. Prove that this problem is NP-hard, by 
reduction from the HAMILTONIAN CYCLE problem. 


B.12. (!) Heuristic algorithms for vertex covering. Algorithm 1: include a vertex of 
maximum remaining degree, delete, iterate until the remaining graph is a stable set. 
Algorithm 2: choose an arbitrary edge, include both endpoints, delete them, iterate 
until the remaining graph is a stable set. The heuristic in Algorithm 1 may seem more 
powerful, but Algorithm 2 has a better performance guarantee! 

a) Prove that algorithm 2 always produces a vertex cover with size at most twice 
the minimum size £(G). 

b) Prove that algorithm 1 may produce a vertex cover of size about log 8(G) times 
the minimum. (Hint: Construct a bipartite graph G fer which Algorithm 1 chooses 
about £(G)/i vertices of degree i for each 1 <i < B(G).) 


B.13. (+) A graph G is a-critical if a(G — e) > a(G) for every e € E(G). Prove that a 
connected a-critical graph has no cut-vertex. (Hint: If e;:e2 are edges incident to a cut- 
vertex x, use the maximum independent sets in G — e and G — ee to build an independent 
set-in G with more than a(G) vertices.) 


B.14. SATISFIABILITY differs from 3-SAT in that clauses may have arbitrary size. 
Prove that a clause containing more than three literals can be replaced by clauses with 
three literals (allowing the addition of a few new variables) so that the original clause 
is satisfiable if and only if the new instance of 3-SAT is satisfiable. Cunclude that 
SATISFIABILITY reduces to 3-SAT. (Karp {1972]) 


B.15. (!) Given that HAMILTONIAN CYCLE is NP-complete for 3-regular graphs, prove 
that COVERING CIRCUIT is.NP-complete. This is the question of whether the input 
graph G contains a closed trail that includes at least one endpoint of every edge. Prove 
that the line graph of G is Hamiltonian if and only if G hag a covering circuit. Conclude 
that HAMILTONIAN CYCLE is NP-complete for line graphs. 


B.16. The DOMINATING SET problem considers an input graph H and an input inte- 
ger k and asks whether H has a dominating set of size at most k (Definition 3.1.26). 

a) Given a graph G, let G’ be the graph obtained by adding an extra copy of each 
edge of G and subdividing one copy of each edge (thus each edge is replaced by a trian- 
gle involving a new vertex). Prove that G has a vertex cover of size at most k if and only 
if G’ has a dominating set of size at most k. 

b) Use the NP-completeness of VERTEX COVER to prove the NP-completeness of 
DOMINATING SET. 


B.17. Prove the claims made in Theorem B.12 about 3-colorings of the gadget H. 


B.18. (*«) Use HAMILTONIAN PATH in directed graphs to prove that 3-MATROID 
INTERSECTION is NP-complete. 


B.19. (*) Use 3-D MATCHING to prove that 3-MATROID INTERSECTION is NP- 
complete. Given a collection of triples of the form (x1, x2, x3) with x; € V; and sets 
V,, Ve, V3 disjoint, 3-D MATCHING is the problem of finding the maximum number of 
triples such that each element appears in at most one of the triples selected (by compar- 
ison, bipartite matching is the 2-D matching problem with the sets being pairs). 
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Appendix C 


Hints for Selected Exercises 


In this appendix we provide some general guidelines and some specific 
suggestions for selected exercises. This should help students who have trouble 
getting started in finding and writing proofs. 


GENERAL DISCUSSION 


The first step is making sure that one understands exactly what the prob- 
lem is asking. Some problems request a verification of a mathematical state- 
ment. Definitions may provide a road map for what needs to be verified. Some- 
times, the desired statement follows from a theorem already proved, and then 
one needs to verify that its hypotheses hold. 

Other problems may involve some experimentation to discover the math- 
ematical statement that needs to be proved. Sometimes one examines small 
cases to discover a general pattern and then proves that pattern by induction. 
In other problems, exploration of examples can help one understand why the 
claim is true. 

Understand definitions and use them carefully. A disconnected graph need 
not have an isolated vertex. Loops and cycles are different concepts. Under- 
stand the difference between maximal and maximum. A vertex cover is a set 
of vertices, and an edge cover is a set of edges. A graph with connectivity 3 is 
2-connected, and a 3-chromatic graph is 17-colorable. 

When seeking a direct proof of a conditional statement, one can work from 
both ends. List statements that follow from the hypothesis. List statements 
that suffice to imply the conclusion. When some statement appears in both 
lists, the problem is solved. 

When unsuccessful with the direct method, list what would follow if the 
conclusion were not true. If something in this list contradicts something in the 
list of statements that follow from the hypothesis (or other known true state- 
ments), then again the problem is solved, using the method of contradiction. 
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Contradiction is particuiarly appropriate for statements of impossibility. 
To prove that something exists, often one can construct an example and prove 
that it has the desired properties; this is the direct method. 

Most conditional statements can be interpreted as universally quantified 
statements. The proof of a universally quantified statement must be valid for 
every value of the variable in the given universe. Examples can help one un- 
derstand or explain a proof, but an example by itself does not provide a proof. 
Explaining why the example has the desired property, in language that applies 
to all possible examples, can lead to the proof of the desired statement. 

Induction often works to prove statements that have a natural number 
parameter. Beware of the induction trap |—(Example 1.3.26, Example A.28). 
Remember the template for proving conditional statements by induction (Re- 
mark 1.3.25). Exploration of small cases can help one understand either the 
statement to be proved or the way to go from one value of the parameter to the 
next, but such discussion is not part of the final proof except as needed in the 
basis step. 

Other techniques include extremality and the pigeonhole principle. Some- 
times one considers a smallest counterexample to a desired statement and then 
uses its existence to obtain a smaller counterexample. This can be viewed as 
induction or contradiction or extremality. 

It may not be obvious what technique works in a particular problem. Some- 
times many different techniques work and produce different proofs. Mathe- 
maticians find proofs by working hard; both stubbornness and flexibility are 
virtues. One tries all imaginable techniques to solve a problem. Practice in- 
creases understanding and speed in finding proofs. 

The final step is to produce a careful and complete exposition of the solu- 
tion. Writing out a proof can reveal hidden subtleties or cases that have been 
overlooked. It can also expose thoughts that turn out to be irrelevant. Pro- 
ducing a well-written solution often involves repeated revision. It is helpful 
to write a solution early, put it aside, and read it again before submission to 
see whether it is still complete, convincing, and comprehensible. The process 
of writing solutions helps develop a useful skill: the ability to express oneself 
concisely, clearly, and accurately. 


SUPPLEMENTAL SPECIFIC HINTS 


1.1.14. Contradiction often helps with a nonexistence conclusion. Suppose that such a 
decomposition exists; what conclusions can be drawn about the board? 


1.1.25. Another nonexistence conclusion. Assume that a 7-cycle exists, and use the 
properties of the Petersen graph to obtain a contradiction. 


1.1.26. Consider an edge in G (one can also start with a vertex). 
1.1.27. Start with a vertex in G. 


1.1.29. Consider the acquaintances or nonacquaintances (whichever set is larger) of 
one particular person. 
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1.1.32. Consider the parity of the sizes of the partite sets. 


1.1.34. Since the three subgraphs are to be pairwise isomorphic, it will help to examine 
a drawing of the graph that has three-fold symmetry. 


1.1.37. Compare contributions from the ends of the paths with contributions from the 
internal vertices. 


1.1.38. Get a decomposition from a bipartition and a bipartition from a decomposition. 


1.2.15. In this problem the starting vertex is specified, so having the first and last edge 
be the same is not enough. 


1.2.17. Use the transitivity of the connection relation. 
1.2.18. Draw G for small values of n to see what the answer should be; then prove it. 


1.2.19. For the upper bound, use the fact that when a and b are relatively prime, there 
are integers p and q such that pa+qb = 1. 


1.2.26. The characterization of bipartite graphs makes this easy. 
1.2.28. The problem does not restrict attention to induced subgraphs. 
1.2.38. Use induction and Lemma 1.2.25. 


1.2.40. If P and Q are disjoint, then consider a shortest path from V(P) to V(Q), and 
obtain a contradiction. 


1.3.12. It may help to construct the example first in the case k = 1 and then generalize. 
1.3.15. For part (b), consider the complements. 


1.3.18. Assuming that e is a cut-edge and'H is a component of G — e, count the edges 
of H from the viewpoint of each partite sgt. Setting these formulas equal leads to a 
contradiction. 


1.3.19. For the second part, make the desired graphs correspond to 3-regular graphs. 


1.3.22. In part (a), what happens if an outside vertex has three neighbors in V(C)? 
For part (b), part (a) provides one bound on the number of edges between V(C) and 
V(G) — V(C); the hypothesis on minimum degree provides another. 


1.3.28. When k is even, exhibit an isomorphism. When k is odd, find an odd cycle in Q,. 
1.3.31. For part (a), consider Example 1.3.18. 

1.3.33. For part (a), establish a one-to-one correspondence between the nonneighbors 
of x and the pairs of neighbors of x. 

1.3.34. Consider two adjacent vertices x, y, and establish a one-to-one correspondence 
between N(x) and N(y). 

1.3.43. For the construction, a regular graph can’t work. Vertices of high and low 
degrees are needed, but both must be adjacent to high-degree vertices. 


me 


1.3.50. Construct for each n an example with few edges, and use induction on n to prove 
that it is optimal. The degree-sum formula implies that an n-vertex simple graph with 
fewer than 3n/2 edges has a vertex of degree at most 2. 

1.3.53. Define a graph to model pairs of people who can still play together. What is the 
condition that permits an additional game to be played? 

1.3.55. For part (a), show first that A(G) > n(G)/2. For part (b), show that G must be 
disconnected. 

1.3.57. Keep the paradigm of Remark 1.3.25 in mind. 

1.3.63. Any inductive proof of sufficiency must verify that the “smaller object” satisfies 
the condition before the induction hypothesis can be applied. 
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1.4.16. For part (a), keep the definition of / in mind. 
1.4.23. The shortest proof uses a graph transformation. 


1.4.25. Given an orientation with more than two vertices of odd outdegree, make an 
appropriate alteration. 


1.4.29. Use the strong connectedness of D and the reference odd cycle of G to build an 
odd cycle in D. 


1.4.34. Show that in the subgraph F of G consisting of edges oriented oppositely in 
H, indegree equals outdegree at every vertex. Bring G closer to H by finding a 3-cycle 
reversal involving a vertex of maximum outdegree in F. 


1.4.37. Use strong induction on the order of the tournament. 


1.4.38. In one direction, show that there is such a tournament with n vertices if there 
is one with n — 2 vertices. Be careful about n = 6. 


2.1.2. In part (b), the statement includes the possibility of adding multiple copies of 
‘edges already present. 


2.1.17. Compare with the proof of A>B,C in Theorem 2.1.4. 
2.1.25. Use induction on n; for the induction step, delete a leaf. 


2.1.27. Keep the quantifier in mind. Prove that the condition on the list of numbers is 
both necessary and sufficiency for the existence of a ‘tree with vertex degrees dj, ..., dy. 
Two implications must be proved. 


2.1.29. Count the edges in two ways. 
2.1.31. Consider the contrapositive. 

2.1.33. Two implications must be proved. = 
2.1.34. Use induction on n. 


2.1.40. Express G as a union of the right number of paths. What can be done if they 
are not pairwise intersecting? 
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ach concerns a connected n-vertex graph. 
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2.1.41. Use a spanning tree of some component. 

2.1.47. In part (a), remember that the union of a u, v-path and a v, w-path need not be 
au, w-path. 

2.1.59. In the problem, bothn and k are fixed. The answer must be given in terms of 
these two parameters. 


2.1.61. Form G’ from G — x — y by adding k disjoint edges joining Ng(x) to Nc(y). 
2.2.5. Consider for each 5-cycle the number of edges that will be used. 

2.2.7. By symmetry, each edge of K, lies in the same number of spanning trees of K,. 
2.2.9. Use the Priifer code. 

2.2.19. In a tree with vertex set [n], cut the edge at vertex n on the path from n to 1. 
2.2.24. Build the edges in decreasing order of the difference between their endpoints. 
2.2.29. Ifa tree is not a caterpillar, then it contains the tree Y of Example 2.2.18. 
2.2.33. Consider the set of vertices reachable by paths from the root. 


2.3.11. What happens if a minimum-weight spanning tree contains an edge of weight 
larger that the maximum weight in a bottleneck spanning tree? 


2.3.13. Consider the heaviest edge of T’ among those not in T. 


2.3.31. In the induction step, partition the set of words in an optimal code into two sets 
according to the first bit. 
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3.1.8. Consider the symmetric difference of two perfect matchings. 


3.1.9. Use vertex covers, or compare a maximal matching with a maximum matching 
using symmetric difference. 


3.1.16. Use induction on k. 
3.1.24. Transform this into a graph problem. 

3.1.25. Find one appropriate permutation matrix and then adjust what remains by a 
constant factor to apply the induction hypothesis. 

3.1.26. Use Corollary 3.1.13 for part (a) and induction on n for part (b). 

3.1.29. ‘What does the Kénig—Egervary theorem say about the vertex cover problem 
when G has no matching of size k. 

3.1.30. This requires a bound and an example that achieves the bound. 

3.1.39. Consider the edges joining a maximum independent set to its complement. 
3.2.11. Suppose that the first occurrence of such a rejection in the Proposal Algorithm 
is a rejecting x even though xa is a pair in some stable matching M. If a rejects x for y, 
note that y must be paired with some women b in M. What can be deduced about the 
preferences of these people? 

3.3.2. Vertex cover is not strong enough to prove optimality of the matching. 

3.3.7. This is easy when k is even. For odd k, construct an example for k = 3 and 
generalize it. 

3.3.11. Use Tutte’s Theorem. 
3.3.12. To prove that there is a matching syhose size is the weight of some generalized 
cover, let T be a maximal set of vertices maximizing the quantity o(G — T) — |T| (the 
deficiency). 


3.3.14. Show for each S C V(G) that og S) — |S| is sufficiently small. A different 
approach is to let X be the k-vertex set inducing the fewest edges and prove that there 
is a matching from X into X. 


3.3.16. Generalize the argument of Corollary 3.3.8. 

3.3.17. Given adjacent vertices x, y, verify Tutte’s condition for G — x — y. 

3.3.18. Think of an appropriate size of a Tutte set S and an appropriate number of odd 
components in G — S. 

3.3.19. Use Petersen’s Theorem to obtain a 1-factor. Assemble copies of P, by consid- 
ering a consistent orientation of the cycles in the remaining 2-factor. 
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4.1.5. Show that G’ is connected and has no cut-vertex. There is also a short proof 
using internally disjoint paths: 

4.1.10. Theorem 4.1.11 and Corollary 1.3.5 yield a short proof. 

4.1.14. Prove the contrapositive. 

4.1.17. Show first that if |[5, 5]| = 3, then S and S have odd size. 

4.1.18. For the first part, show that the subgraph induced by the smaller side of an 
edge cut with at most two edges has too many edges to avoid having a triangle. 

4.1.23. Verify Tutte’s Condition. Keep in mind that the forbidden condition is large 
stars as induced subgraphs, not just any large stars. 


4.1.26. Necessity is proved by following individual cycles. For sufficiency, define an 
appropriate auxiliary graph whose vertices are the components of G — F. Show that’ 
this graph is bipartite to obtain the desired partition of the components of G — F. 
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4.1.27. Recall that an edge cut is a bond if and only if on each side of the cut the vertices 
induce a connected subgraph. 


4,2.6. Use ear decomposition. 


4.2.14, This is essentially an edge version of Theorem 4.2.2, with the added coriclusion 
that the common points happen in the same order on the two paths. 


4.2.21. What do we know about connected graphs having at most two vertices of odd 
degree? 

4.2.23. From a given bipartite graph G, design a graph H so that applying Menger’s 
Theorem to H will yield the needed result on G. 


4.2.28. Use the Expansion Lemma and Menger’s Theorem. 


4.3.13. Design a network so that there is an assignment of participants to groups if 
and only if the network has a flow of value )\ m;. Use the Ford—Fulkerson Theorem to 
express the condition for such a flow in terms of cuts. Show that the given condition on 
the data is equivalent to that condition on the source/sink cuts. 


5.1.20. One approach is to prove the contrapositive. Another is to delete an odd cycle. 


5.1.22. An ordering is needed so that when each vertex is encountered, it has at most 
two neighbors among the earlier vertices. 


5.1.23. The lower bounds involve counting and/or the pigeonhole principle. The inter- 
esting part is providing a construction to show that the graph is k + 2-colorable with 
k + 1 does not divide n. = 
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5.1.26. (also next problem) Obtain a clique and a proper coloring of the same size. 


5.1.30. Given a proper r-coloring of G,, produce a collection of distinct subsets of the 
colors, each subset associated with an element of n. Conversely, show how to use such a 


of 


collection to produce a proper coloring.  — 


5.1.31. From a proper m-coloring, build an independent set of the same size. From a 
big enough independent set, construct a proper m-coloring. 

5.1.32. As in Theorem 1.2.23, one can use induction or encode the colors as binary 
k-tuples and apply the pigeonhole principle. 

§.1.39. Obtain a quadratic inequality for k in terms of m by using upper and lower 
bounds on e(G). 

5.1.41. Using the induction hypothesis, a new graph can’t violate the bound unless we 
delete a vertex and find that the chromatic numbers of both the graph and its comple- 
ment decrease. Is it possible for this to happen when these chromatic numbers already 
sum to the maximum? 

5.1.44. Obtain the upper bound from part (a). For the lower bound, use the same 
orientation as in Theorem 5.1.21. 

5.1.51. Modify some proper k-coloring so that it becomes a proper k + 1-coloring that 
has the pre-specified values on the vertices of P. 


5.2.2. Consider the complement. 


5.2.9. Since G’ is connected, it suffices to consider deletion of edges from G’; see Remark 
5.2.12. 


5.2.15. Use large neighborhoods as color classes while there remain vertices of high 
degree; then apply Brooks’ Theorem. 


5.2.17. For part (b), consider complements. 
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5.2.19. Modify K, + K,-, to obtain 7,,, counting the changes in the number of edges. 


§.2.21. The proof of Theorem 5.2.9 transforms a graph with nor + 1-clique into an r- 
partite graph with at least at many edges. The number of edges strictly increases unless 
equality holds in each step of the computation. What is needed for equality to hold? 


5.2.27. For the upper bound, deleting the edges of a cycle from a counterexample leaves 
a forest, which restricts the girth. Reducing to the case 5(G) > 3 now requires n < 8, 
which contradicts Exercise 5.2.26. 


5.2.28. For the upper bound, reduce to the case 5(G) > 3, and consider a shortest cycle 
C. Deleting V(C) leaves a forest, and its leaves have at least three neighbors on C. 


5.2.29. For part (b), if the largest and smallest color classes differ in size by more than 
1, use part (a) to alter the coloring appropriately. 


5.2.32. Part (a) uses the properties of k-critical graphs for G and H. In part (c), one 
can give explicit examples with orders 4, 6, 8 and then apply part (a). 


5.2.40. To compute the chromatic number, consider independent sets. To forbid sub- 
divisions of complete graphs, consider vertex cuts; a subdivision of K, must have k — 1 
pairwise internally disjoint paths joining two of its vertices of degree k — 1. 


5.2.43. Use induction on k. In the induction step, discard an appropriate subset of V(G). 
5.2.44. Reduce to the case 6(G) > k — 1 and then use induction on k. 

5.3.3. If x(G;k) = k* — 4k° + 3k?, then how many edges and vertices does G have? 
5.3.4. For part (a), use the chromatic recurrence or Theorem 5.3.10. 


5.3.6. Explain how contributions to the coefficient of k"©! arise in computing 
wera Pr (Gk) 


5.3.12. For part (a), use the chromatic recurrence. For part (b), consult Exercise 1.3.40 
for the maximum number of edges in an n-vertex graph with r components. 
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5.3.18. Part (a) implies that part (b) needs only one computation. Expressing a chro- 
matic polynomial as the sum of two chromatic polynomials involves addition of an edge 
as in Example 5.3.9. 


5.3.23. Use a simplicial elimination ordering. 


5.3.26. For part (a), use a simplicial elimination ordering; the simplicial vertex may or 
may not bein GN H. For part (b), N(x) may or may not equal V(G) — x. 


5.3.28. Build a simplicial elimination ordering of G and a transitive orientation of G. 


6.1.20. Which plane graphs have only one face? When a plane graph has more than 
one face, what kind of edge can be deleted to reduce the number of faces? 


6.1.24. In the induction step, delete an edge of a cycle. 

6.1.25. Use Euler’s Formula. 

6.1.28. This can be proved by applying Euler’s Formula to an appropriate planar graph. 
Also it can be proved using induction. 

6.1.30. Mimic the proof of Theorem 6.1.23. 

6.2.6. In order to apply the claim and the induction hypothesis, find an appropriate 
vertex to delete from a larger graph. 

6.2.7. Given the graph G to be tested, construct a graph H such that G is outerplanar 
if and only if H is planar. Kuratowski’s Theorem then applies to H. 

6.2.8. Several cases need to be considered concerning how a subdivision of K5 might 
be arranged in the graph. 
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6.2.9. For the construction, start with n = 5. Reduce the upper bound to considering 
planar graphs with 2n edges that have minimum degree at least three and contain a 
triangle. In each case, obtain disjoint cycles or a subdivision of K3 3. 


6.2.11. Let H’ be a subgraph contractible to H. Prove that if H is not all of H’, then- 
H' has an edge incident to a vertex of degree 2. Use this in an inductive proof. 


6.3.5. Use the Four Color Theorem. 


6.3.9. When C has length 4, replacing the inside (or outside) with a single edge between 
opposite vertices of C allows one to obtain a 4-coloring of a C-lobe that uses distinct 
colors on two opposite vertices of C. 


6.3.12. For the construction, use groups of three vertices to build ‘alcoves” such that 
no guard can see into more than one alcove. 


6.3.20. Show that the lower bound resulting from Proposition 6.3.10 is at least r/2 
when s > (r — 2)?/2, and provide a decomposition into r/2 planar subgraphs. 


6.3.27. Consider the copies of K,,., in a drawing of Km+in- 


6.3.28. (also next problem). Consider what happens when a vertex moves across an 
edge. For the second part, consider a drawing where the crossings are easy to count. 


7.1.11. For necessity in part (b), show first that the average vertex degree must be 2. 


7.1.16. Letting G = L(A), use H to show i S ¢ V(G) that the number of components 
of G — S is at most |5|+ 1. 


7.1.17. Extract an embedding of G from an F anibedding of L(G). 
7.1.20. Consider Definition 1.4.20. 


7.1.24. It may help to consider first the s 
proved separately. 
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7.1.26. Every edge incident to the cut-vertex must appear in some color. 


7.1.33. Show that an improvement can be made when one color appears too often and 
another not enough. 


7.2.17. Reduce the problem to the case where the two Hamiltonian graphs are cycles. 
7.2.23. For each k, determine how large'S must be so that G — S has k components. 


7.2.29. Write the given condition involving the degrees of G and G solely in terms of 
the degree of G; then show that G satisfies Chvatal’s condition for spanning paths. 


7.2.31. Use the Chvatal—_Erdés Theorem 


7.2.32. Be careful about how the transformation modifies the degrees; the condition is 
stated correctly. (Theorem 7.2.19). 


7.3.3. The dual graph is 3-regular. 

7.3.4. Consider the faces inside and outside a spanning cycle separately. 

7.3.5. Translate this problem into a statement proved earlier. 

7.3.17. Reduce this to studying the graph analyzed in Example 7.3.6. 

7.3.18, Modify the graph to obtain a situation where Grinberg’s Theorem applies. 


7.3.21. Given the triple of colors on {x;_1x;, y;-1y;, Zi-12;} consider the possibilities for 
the triple of colors on {x;xj41, Vi¥ie1, 2241}. 
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Glossary of Terms 


In addition to terms used in this book, this glossary also contains other 
related terms that the reader may encounter in further study. This includes 
some alternative terminology used by other authors. 

The items are informal sketches of definitions. Numbers in brackets are 
page references for the full definition or the first usage. When used without 
specification, “G” indicatés a graph or possibly also a digraph, “D” indicates a 
digraph, v or e indicate a vertex or edge, and n indicates the number of vertices. 


Absorption property (matroids) [351]: r(X) = XU e) =r(X U f) implies r(X) = f(X U f Ue) 

Acyclic [67]: without cycles 

Acyclic orientation [203, 208]: orientation without cycles 

Adjacency matrix A [6]: entry a;,; is number of edges from vertex i to vertex j 

Adjacency relation: set of unordered or ordered pairs forming edges in graph or digraph 

Adjacency set N(v): the set of vertices adjacent to v 

Adjacent [2]: vertices that are endpoints of an, n, edge, sometimes used to describe edges with a 
common endpoint 

Adjoins: is adjacent to 

Adjugate: matrix of cofactors 

Almost always [430]: having asymptotic probability 1 

M-alternating path: a path alternating between-edges i in M and not in M 

Ancestor [100]: in a rooted tree, a vertex along the path to the root 

Antichain: family of pairwise incomparable items (under an order relation) 

Anticlique: stable set 2 

Antihole: induced subgraph isomorphic to the complement of a cycle 

Approximation algorithm [496]: polynomial-time algorithm with bounded performance ratio 

Approximation scheme [496]: family of approximation algorithms with arbitrarily good perfor- 
mance ratio 

Arborescence: a directed forest in which every vertex has outdegree at most one 

Arboricity Y(G) [372]: minimum number of forests covering the edges 

Are: directed edge (ordered pair of vertices) 

k-arc-connected: same as k-edge-connected for digraphs 

Articulation point: a vertex whose deletion increases the number of components 

Assignment Problem [126]: minimize (or maximize) the sum of the edge weights in a perfect 
matching of a complete bipartite graph with equal part-sizes 

Asteroidal triple [346]: three distinct vertices with each pair connected by a path avoiding the. 
neighborhood of the third 
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Asymmetric: having no automorphisms other than the identity 

Asymptotic [431]: having ratio approaching 1 

Augmentation property (matroids) [352]: /,, /2 € Iwith |/2.| > |/,| implies the existence ofe € lg—/; 
such that /; Ve el 

Augmenting path [109]: for a matching, an alternating path that can be used to increase the size 
of the matching; for a flow, increases the flow value 

Automorphism [14]: a permutation of the vertices that preserves the adjacency relation 

Automorphism group T: the group of automorphisms under composition 

Average degree: 5° d(u)/n(G) = 2e(G)/n(G). 

Azuma’s Inequality: a bound on the probability in the tail of a distribution 


Backtracking [156]: depth-first-search 

Balanced graph [434]: the full graph is the subgraph with the largest average vertex degree 

Balanced k-partite: partite sets differ by at most one in size (see equipartite) 

Bandwidth: the minimum, over vertex numberings by distinct integers, of the maximum difference 
between labels of adjacent vertices 

Barycenter [78]: vertex minimizing the sum of distances to other vertices 

Base (matroids) [3849]: maximal independent set 

Base exchange property (matroids) [351]: for all Bj, By € Bande € B; — Bo,-there exists an element 
f € Bz — B, such that B, —e+ f is a base. 

Berge graph [340]: a graph with no odd hole or odd antihole 

Best possible: fails to be true when some condition is loosened 

Bicentral tree: a tree whose center is an edge 

Biclique [9]: complete bipartite graph 

Biconnected: 2-connected 

Bigraphic [65, 185]: a pair of sequences realizable as the vertex degrees for the partite sets in a 
simple bipartite graph 

X, Y-bigraph [24]: a bipartite graph with Bia ition X,Y 

Binary matrix (or vector): all entries in {0, 1} B 

Binary matroid [357]: representable over the field with two elements 

Binary tree [101]: rooted tree in which every non-leaf vertex has at most two children 

Binomial coefficient [487] Gi: the number of ways to choose a subset of size k from an n-element 
set, equal to n!/[k!(n — k)!]. 

Biparticity [422]: number of bipartite subgraphs needed to partition the edges 

Bipartite graph [4]: a graph whose vertices can be covered by two independent sets 

Bipartite Ramsey number: for a bipartite G, the minimum nv such that 2-coloring the edges of Ky,» 
forces a monochromatic G 

Bipartition [24]: a partition of the vertex set into two independent sets 

Birkhoff diamond [259]: a particular reducible configuration for the Four Color Problem 

Block [155]: (1) a maximal subgraph with no cut-vertex; (2) a graph with no cut-vertex; (3) a class 
in a partition of a set 

Block-cutpoint graph 156: simple bipartite graph in which the partite sets are the blocks and the 
cutvertices of G and the adjacency relation is containment 

Block graph: intersection graph of the blocks in G 

Blossom [142]: an odd cycle arising in Edmonds’ algorithm for general matching 

Bond [154]: a minimal edge cut 

Bond matroid [362]: dual of the cycle matroid of a graph 

Bond space [452]: orthogonal complement to the cycle space; linear combinations of bonds (over 
field of two elements) 

Book embedding: a decomposition of G into outerplanar graphs with a consistent ordering of the 
vertices (as on the spine of a book) 

Bougqiiet: a graph consisting of one vertex and some number of loops 

Branch vertex [249]: a vertex of degree at least 3 

Branching: a digraph where each vertex has indegree one except one that has indegree 0 

r-branching [404]: branching rooted at r 

Breadth-first search [99]: a search exploring vertices in order by distance from root 

Breadth-first tree: tree generated by a breadth-first search from a root 
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Bridge [304]: cut-edge 

H-bridge of G: H-fragment (used by other authors) 

Bridgeless graph [304]: graph without cut-edges 

Brooks’ Theorem: x(G) < A(G) for connected graphs, except for cliques and odd cycles 


Cactus [160]: a connected graph in which every edge appears in at most one cycle 

(k, g)-cage [49]: a kregular graph of smallest order among those with girth g 

Capacity [176, 178]: a limit on flow (1) through an edge in a network; (2) across a cut 

Cartesian product G1; 0 Ge [193]: the graph with vertex set V(G1) x V(G2) and edges given by 
(uy, U2) <— (1, v2) if 1) wy = v1 and ug — ve in Gg or 2) ug = ve and uy © v; in G, 

Caterpillar [88]: a tree with a single path containing at least one endpoint of every edge 

Cayley’s Formula [81]: statement there are n"~? trees with vertex set [n] 

2-cell [268]: on a surface, a region homeomorphic to a disc, meaning that every closed curve is 
contractible to a point 

2-cell embedding [268]: an embedding in which every region is a 2-cell 

Center [72]: subgraph induced by the vertices of minimum eccentricity 

Central tree [78]: a tree whose center is one vertex 

a, B-chain: a path alternating between colors a and 8 

Characteristic polynomial ¢(G; \) [453]: characteristic polynomial of the adjacency matrix of the 
graph, whose roots are the eigenvalues 

Children [100]: in a rooted tree, neighbors of the current vertex that are farther from the root 

Chinese Postman Problem [99]: problem of finding the cheapest closed walk covering all the edges 
in an edge-weighted graph 

Choice number [408]: choosability 

Choosability x;(G) [408]: minimum k such that G is k-choosable 

k-choosable [408]: for all lists of size k assigned to vertices of G, there is a proper coloring that 
selects a color for each vertex from its list _' 

Chord [225]: edge joining two nonconsecutive vertices of a path or cycle 

Chordal graph [225]: having no chordless cycle = 

Chordless cycle [225]: an induced cycle of length at least 4 

Chordiess path: a path that is an induced subgraph 

Chromatic index x'(G) [275]: edge-chromatic number 

Chromatic number x (G) [5, 191]: minimum number of colors in a proper coloring. 

Chromatic polynomial x(G; k) [220]: a polynomial whose value at k is the number of proper color- 
ings of G using colors from {1,..., k}. 

Chromatic recurrence: recurrence relation for chromatic polynomial 

k-chromatic [192]: having chromatic number k 

Circle graph [341]: an intersection graph of chords of a circle 

Circuit [27, 60]: equivalence class of closed trails without specifying starting vertex (an even graph); 
(caution—used by some authors to mean cycle) 

Circulant graph: a graph constructed as equally-spaced vertices on a circle with adjacency depend- 
ing only on distance 

Circular-arc graph [341]: an intersection graph of arcs of a circle 

Circulation [187]: a flow in a network with net flow 0 at each vertex 

Circumference [293]: the length of the longest cycle 

Clause [499]: a collection of literals in a lngical (Boolean) formula 

Claw [12]: the graph K 13 

Claw-free: having no induced K,,3 

Clique [4]: set of pairwise-adjacent vertices (used by many authors to mean complete graph) 

Clique cover [226]: a set of cliques covering the vertices (minimum size = 0(G)) 

Clique decomposition: a partition of the edge set into complete subgraphs 

Clique edge cover: a set of complete subgraphs covering the edges 

Clique identification: a perfection-preserving operation that merges cliques in two graphs 

Clique number w(G): maximum order of a clique in G 

Clique partition number: minimum size of a clique decomposition 

Clique tree [327]: an intersection representation of a chordal graph, consisting of a host tree with 
a bijection between its vertices and the maximal cliques of G such that the cliques containing 
each vertex form a subtree of the host 
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Clique-vertex incidence matrix [328]: 0,1-matrix in which entry (i, j) is 1 if and only if vertex j 
belongs to maximal clique i 

Closed ear [164]: a path between two (possibly equal) old vertices through new vertices 

Closed-ear decomposition [164]: construction of a graph from a cycle by addition of closed ears 

Closed neigborhood [116]: a vertex and all its neighbors 

Closed set (matroids) [360]: a set whose span is itself 

Closed walk [20]: a walk whose last vertex is the same as its first 

Closure [289, 360]: (1) the graph C(G) obtained from G by iteratively adding edges joining nonad- 
jacent vertices with degree-sum at least n(G); (2) image under a closure operator 

Closure operator [360]: an operator that is expansive, order-preserving, and idempotent 

Cobase [360]: a base of the dual matroid 

Cocircuit [360]: a circuit of the dual matroid 

Cocritical pair: two nonadjacent vertices whose addition as an edge increases the clique number 

Cocycle matroid [362]: the dual of a cycle matroid 

Cocycle space: bond space 

Cograph [202]: P4-free graph (equivalent to complement reducible graph) 

Color class [191]: in a coloring, a set of objects having the same color 

Color-critical [192]: a graph such that every proper subgraph has smaller chromatic number 

k-colorable [191]: having a proper coloring with at most k colors 

k-coloring [191, 380]: a partition into k sets 

P coloring: a vertex partition into subsets inducing graphs with property P 

Column matroid M(A) [351]: matroid whose independent sets are the linearly independent subsets 
of columns of the matrix A 

Comma-free code: no code word is a prefix of another 

Common system of distinct representatives (CSDR) [171]: given families A and B of sets, a CSDR 
is a set of elements that is an SDR of A and i is an SDR of B 

Comparability graph [228]: graph having a transitive orientation 

Complement G[3]: simple graph or digraph with the same vertex set as G, defined by uv € E(G) if 
and only if uv ¢ E(G) 

Complement reducible [344]: reducible to the trivial graph by iteratively taking complements of 
components 

Complete graph K,, [9]: simple graph in which eich two vertices are adjacent 

Complete k-partite graph Kn, .....n, [207]: k-partite graph in which every pair of vertices not belong- 
ing to the same partite set is adjacent (sizes of the partite sets are n1,..., nx) 

Completely labeled cell [388]: simplicial region with distinct labels on corners 

Complexity [494]: the worst-case number of operations needed, as a function of the input size 

Component [22]: maximal connected subgraph 

S-component of G: see S-lobe 

Composition G,[G2] [3832]: a graph whose vertex set is the cartesian product V(G1) x V(Go), 
defined by (ui, u2) <> (v1, ve) if and only if u, < v, in Gj, or uy = v1 and ug © v2 in G2 

Conflict graph [252]: graph whose vertices are the bridges of a cycle, with bridges adjacent (con- ; 
flicting) when they have three common endpoints or four alternating endpoints on the cycle / 

Conflicting chords: two chords whose endpoints alternate on a specified cycle / 

Conjugate partition: two partitions of n such that one gives the row sizes and the other the column 
sizes of a Ferrers diagram 

Connected [6]: having a u, v-path for every pair of vertices u, v 

k-connected [149, 164]: having connectivity at least k 

Connection relation [21]: relation satisfied by vertices x, y if there is an x, y-path 

Connectivity «(G) [149, 164]: the minimum number of vertices whose deletion disconnects the 
graph or reduces it to one vertex (sometimes called “vertex connectivity” for clarity) 

Consecutive 1s property (for rows) [328]: having a permutation of columns so 1s appear consecu- 
tively in each row 

Conservation constraint [176]: for a flow, the condition of net flow 0 at a vertex 

Consistent rounding [186]: conversion of the data and the row/column sums in a matrix to nearest 
integers up or down such that row and column sums remain correct 

Construction procedure: a procedure for iteratively building members of a class of graphs from a 
small base graph or graphs 

Contraction [84]: replaces edge uv by a vertex w incident to the edges formerly incident to u or v 
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Converse D~': obtained from digraph D by switching the head and tail in each edge 

Convex embedding [248]: a plane graph in which every bounded face is a convex set and the outer 
boundary is a convex polygon 

Convex function [443]: satisfies the inequality f(Aa + (1 — A)b) < Af(a) + (1—A) f(b) for all a,b 
and0Q<A<1 

Convex quadrilateral: not no corner in the triangle formed by the other three 

Cost [125]: name of the objective function for many weighted minimization problems 

Cotree: with respect to a graph, the edges not belonging to a given spanning tree 

F-covering: covering of edge set by subgraphs in the family F 

Critical edge [122, 339]: edge whose deletion increases the independence number 

Critical graph: used with respect to many graph properties, indicating that the deletion of any 
vertex (or edge, depending on context) destroys the property 

k-critical graph [192]: usually means color-critical with chromatic number k 

Critically 2-connected: deletion of an edge destroys 2-connectedness 

Crossing [234]: in a drawing of a graph, an internal intersection of two edges 

Crossing number v(G) [262]: minimum number of crossings when drawing G in the plane 

k-cube Q, [36]: k-dimensional cube 

Cubic graph [304]: a regular graph of degree 3 

Cut [S, S] [166]: the edges from a vertex subset to its complement (especially in networks) 

Cut-edge [23]: an edge whose deletion increases the number of components 

Cutset: a separating set of vertices 

Cut-vertex [23]: vertex whose deletion increases the number of components 

Cycle [5, 55]: a simple graph whose vertices can be placed on a circlé so that vertices are adjacent 
if and only if they appear consecutively on me circle (caution—used by some authors to mean 
even graph) 

Cycle double cover [312]: a list of cycles such that each edge appears in two items in the list 

k-cycle [9]: a cycle of length k, consisting of k vertices and k edges 

Cycle matroid M(G) [350]: the matroid whose circuits are the cycles of G 

Cycle rank: dimension of cycle space, equal to #edges — #vertices + #components 

Cycle space [452]: the nullspace of the incidence matrix; the elements correspond to the even 
subgraphs Bb 

Cyclic edge-connectivity: number of edges that must be deleted to disconnect a component so that 
every remaining component contains a cycle 

Cyclically k-edge-connected: cyclic edge-connectivity at least k 


de Bruijn graph [61]: digraph encoding possible transitions between k-ary n-tuples as additional 
characters are received 

Decision problem [494]: a computational problem with a YES/NO answer 

Decomposition [11]: an expression of G as a union of edge-disjoint subgraphs 

F-decomposition [397]: decomposition using graphs in the family F 

F-decomposition number of G: minimum number of graphs in an F-decomposition of G 

Degree d(v) [6, 34]: (1) for a vertex, the number of times it appears in edges (may be modified by 
“in-” or “out-” in a digraph); (2) for a regular graph, the degree of each vertex 

Degree sequence d, > .-: > d, [44]: the list of vertex degrees, usually indexed in nonincreasing 
order regardless of vertex order 

Degree set: the set of vertex degrees (appearing once each) 

Degree-sum Formula: )) d(v) = 2e(G) 

Deletion method [428] a strengthening of the existence argument in the probabilistic method 

Demand [184]: sink constraint in transportation network 

Density [435]: ratio of number of edges to number of vertices 

Dependent edge [231]: an edge in an acyclic orientation whose reversal creates a cycle 

Dependent set (matroids) [349]: a set containing a circuit 

Depth-first search [156]: backtracking search from a vertex, exploring from the most recently 
reached vertex and backing up when it has no new neighbors 

Descendants of x [100]: in a rooted tree, members of the subtree rooted at x 

Diagonal Ramsey number [385]: Ramsey number for an instance where the thresholds (numbers 
or graphs) are equal 
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Diameter [70]: the maximum of the distance d(u, v) over vertex pairs u, v 

Digraph [53]: directed graph 

Dijkstra’s Algorithm [97]: algorithm to compute shortest paths from one vertex 

Dilworth’s Theorem [413]: maximum number of pairwise incomparable elements equals minimum 
number of totally ordered subsets needed to cover all elements 

k-dimensional cube Q, [36]: simple graph with vertex set {0, 1} where vertices are adjacent if and 
only if their names differ in exactly one coordinate 

Dinitz Conjecture [410]: each bipartite graph G is A(G)-list-edge-colorable 

Directed graph [53]: vertex set, edge set, and specification of head and tail for each edge 

Directed walk, trail, path, cycle, etc. [57]: same as without the adjective “directed” (the head of an 
edge is the tail of the next edge) 

Disc: a planar region bounded by a simple closed curve 

Disconnected [6]: a graph with more than one component 

Disconnecting set [152]: a set of edges whose deletion makes some vertex unreachable from some 
other vertex 

Disjoint union G; + G2 [39]: the union of two graphs with disjoint vertex sets 

Disjointness graph: complement of intersection graph 

Distance d(u, v) [70]: the minimum length of a u, v-path 

Distance-preserving embedding [400]: mapping f: V(G) — V(H)sothatdy(f(u), f(v)) = dg(u, v). 

Dodecahedron [243]: planar graph with 20 vertices, 30 edges, and 12 faces of length 5 

Dominating set [116]: a set 5 C V such that every vertex outside S has a neighbor in S$ 

Domination number [116]: the minimum size of a dominating set of vertices 

Double jump [437]: the markedly different structure of the random graph in Model A for probability 
functions of the form c/n withc < 1,c=1,andc > 1. 

Double star [77]: a tree with at most two vertices of degree more than 1 

Double torus [266]: the (orientable) surface a handles 

Double triangle [280]: K4 —e 

Doubly stochastic matrix [120]: square matrix Kaving sum 1 in each row and column 

Dual augmentation property (matroids) [362]: disjoint sets independent in a matroid and its dual 
can be ‘enlarged to a complementary base and cobase 

Dual edge e* [236]: the edge of the dual graph G* corresponding to edge e of a plane graph G 

Dual graph G* [236]: for a plane graph G, the graph with a vertex for each region of G, where 
vertices are adjacent if the boundaries of their regions in G share an edge (extends to 2-cell 
embeddings on any surface) 

Dual hereditary system (or matroid) M [360]: the hereditary system whose bases are the comple- 
ments of the bases of M 

Dual problem [113]: for a problem maxc?x such that Ax < b and x > 0, the dual is min y7b such 
that yA > cand y> 0 

Duality gap: strict inequality between optimal values of a pair of dual integer programs 

Duplication of vertex x [3821]: adding x’ with N(x’) = N(x) 


Ear [163]: path whose internal vertices have degree two (or are “new”) 

Ear decomposition [163]: construction of G from a cycle by addition of ears 

Eccentricity eg(v) [70]: for a vertex, the maximum distance to other vertices 

Edge [2]: (1) in a graph, a pair of vertices (E(G) denotes the edge set); (2) in a hypergraph, a subset 
of the vertex set 

Edge-choosability x;(G) [409]: minimum k such that G is k-edge-choosable 

k-edge-choosable [409]: for all lists of size k assigned to edges of G, there exists a proper edge- 
coloring that selects a color for each edge from its list 

Edge-chromatic number x’(G) [275]: the minimum number of colors in a proper edge-coloring 

k-edge-colorable [275]: having a proper edge-coloring with at most k colors 

Edge-coloring [274]: an assignment of labels to the edges 

k-edge-connected [152, 164]: having edge-connectivity at least k 

Edge-connectivity «’(G) [152]: the minimum number of edges whose deletion disconnects G 

Edge cover [114]: a set of edges incident to all the vertices 

Edge cut [S, S] [152, 164]: the set of edges joining a vertex in S to a vertex not in S 

Edge-reconstructible: a graph that can be determined (up to isomorphism) by knowing the multiset 
of subgraphs obtained by deleting single edges 
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Edge-Reconstruction Conjecture: the conjecture that every graph with at least four edges is edge- 
reconstructible 

Edge-transitive [18]: having for each pair e, f € E(G) a permutation that maps e to f 

Eigenvalue [453]: for a graph, an eigenvalue of the adjacency matrix 

Eigenvector of A [453]: a vector x such that A, = Ax for some constant A 

Elementary contraction [84]: contraction 

Elementary cycle: boundary of a region in a plane graph (caution - some authors who use “cycle” 
to mean circuit use “elementary cycle” to mean cycle) 

Elementary subdivision [162]: replacement of an edge by a path of two edges connecting the end- 
points of the original edge (see edge subdivision) 

Embedding [234]: a mapping of a graph into a surface, such that (the images of) its edges do not 
intersect except for shared endpoints 

Empty graph [22]: graph having no edges 

Endpoint [2]: (1) each member of an edge; (2) the first or last vertex of a path, trail, or walk 

End-vertex: a vertex of degree 1 

Equipartite [207]: having partite sets differing in size by at most 1 

Equitable coloring: having color classes differing in size by at most 1 

Equivalence [399]: as a graph, a union of pairwise disjoint complete graphs 

Equivalence relation [490]: reflexive, symmetric, and transitive relation 

Erdés number: distance from Erdés in the collaboration graph of mathematicians 

Euler characteristic: for a surface of genus y, 2 — 2y 

Euler tour: Eulerian circuit 

Eulerian circuit [26, 60]: a closed trail containing every edge 

Eulerian (di)graph [26, 60]: a graph or digraph having an Eulerian circuit 

Eulerian trail [26, 60]: a trail containing every edge 

Euler’s Formula [241]: the formula n —e+ f = 2>-2y for 2-cell embeddings of a connected n-vertex 
graph with e edges and f faces on a surface of genus y 

Even cycle [24]: cycle with an even number of edges (or vertices) 

Even graph [26]: graph with all vertex degrees even 

Even pair [348]: vertex pair x, y such that every-chordless x, y-path has even length 

Even triangle [280]: triangle T such that every vertex has an even number of neighbors in T 

Even vertex [26]: vertex of even degree 3) 

Evolution: the model of generating random graphs by successively adding random edges 

(n, k,c)-expander [463]: bipartite graph with partite sets of size n and vertex degrees at most 
k such that each set S with at most half the vertices of the first partite set has at least 
(1+ c(1 — |S| /n) |S| neighbors 

Expansion: in 3-regular graph, subdivides two edges and adds one edge joining the new vertices 

Expansion Lemma [162]: adding a vertex of degree k to a k-connected graph preserves k- 
connectedness 

Expansive property [358]: for a function o on the subsets of a set, the requirement that X C 0(X) 
for all X 

Expectation [427]: for a discrete random variable, ) | kProb (X = k) 

Exterior region: the unbounded region in a plane graph 

Exterior vertex: vertex on the unbounded region 


Face 235]: a region of an embedding 

Factor [136]: a spanning subgraph 

f-factor [140]: a spanning subgraph with d(v) = f(v) 

k-factor [140]: a spanning k-regular subgraph 

k-factorable [276]: having a decomposition into k-factors 

Factorization: an expression of G as the edge-disjoint union of spanning subgraphs 

k-factorization [276]: a decomposition of a graph into k-factors 

x, U-fan [170]: pairwise internally-disjoint paths from x to distinct vertices of U 

Fary’s Theorem [246]: a planar graph has a straight-line embedding in the plane 

Fat triangle [275]: a 3-vertex graph in which each pair has the same edge multiplicity 

Feasible flow [176]: a network flow satisfying edge-constraints and having net ‘flow 0 at each 
internal vertex 


522 Appendix D: Glossary of Terms 


Feasible solution [322]: a choice of values for the variables that satisfies all the constraints in an 
optimization problem 

Ferrers digraph: a digraph (loops allowed) with no x, y, z, w (not necessarily distinct) such that 
x > yandz > w butz # y and x # w; equivalently, the successor sets or predecessor 
sets are ordered by inclusion; equivalently, the adjacency matrix has no 2-by-2 permutation 
submatrix. 

Five Color Theorem [257]: the theorem that planar graphs are 5-colorable 

Flat [266]: a closed set in a matroid 

Flow [176]: an assignment of weights to edges of a network 

k-flow [307]: an assignment of weights in {—k + 1,...,k — 1} to edges of a digraph so that net flow 
out is zero at each vertex 

Flower (in Edmonds’ Blossom Algorithm) [142]: consists of a stem (alternating path from an un- 
saturated vertex) and a blossom (odd cycle with a nearly-perfect matching) 

Forcibly Hamiltonian: a degree sequence such that every simple graph with that degree sequence 
is Hamiltonian 

Forest [67]: a disjoint union of trees, an acyclic graph 

Four Color Theorem [260]: the theorem that planar graphs are 4-colurable 

Fraternal orientation [345]: an orientation such that two vertices are adjacent if they have a 
common successor 

H-fragment of G [252]: a component of G — H together with the edges to its vertices of attachment 

H-free [41]: having no copy of H as an induced subgraph 

Free matroid [357]: matroid in which every set of elements is independent 

Friendship Theorem [467]: if every pair of people in a set have exactly one common friend in the 
set, then someone is everyone’s friend 

Fundamental cycle [374]: for a spanning tree, a cycle formed by adding an edge to it 
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Gammoid [377]: a matroid ou F arising from veux sets F,E ina digraph by letting independent 
sets be those that are saturated by a set of ‘disjoint paths starting in F 

Generalized chromatic number: minimum number of classes needed to partition the vertices so 
that the subgraph induced by each color class has property P 

Generalized Petersen graph [316]: the graph with vertices {u1,..., un} and {v,,..., vz} and edges 
{ujujoi}, {ujui}, and {v; vj+4}, where addition i is modulo n 

Generalized Ramsey number r(G,..., Gx) [386]: the minimum n such that k-coloring the ede of 
K,, forces a copy of G; in color i for some i 

Genus y [266]: (1) for a surface, the number of handles in its topological des~“iption (2) for a graph, 
the minimum genus surface on which it embeds 

Geodesic: a shortest path between its endpoints 

Geodetic: having the property that each pair of vertices u, v are the endpoints of a unique path of 
length d(u, v) 

Girth [13]: the length of a shortest cycle in G 

k-gon: in an embedding, a k-cycle bounding a region 

Good algorithm [124]: algorithm running in polynomial time 

Good characterization [495]: a characterization that is checkable in polynomial time 

Good coloring: often means proper coloring 

Gossip problem [406]: minimize the number of calls so that each vertex transmits to every other 
by an increasing path 

Graceful labeling [87]: an assignment of distinct integers to vertices such that 1) the integers are 
between 0 and e(G), and 2) the differences between the labels at the endpoints of the edges 
yield the integers 1,..., e(G) 

Graceful graph [87]: a graph with a graceful labeling 

Graceful tree [87]: a tree with a graceful labeling 

Graceful tree conjecture [87]: every tree has a graceful labeling 

Graph [2]: a. set of vertices, a set of edges, and an assignment of a set at most two vertices as 
endpoints of each edge 

Graphic matroid M(G) [350]: matroid whose independent sets are the acyclic subsets of E(G) 

Graphic sequence [44]: a list of integers realizable as the degree sequence of a simple graph 

Greedy algorithm [95, 354]: a fast algorithm to find. a good feasible solution by iteratively making 
a heuristically good choice 
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Greedy coloring [194]: with respect to some vertex ordering, color each vertex with the least-indexed 
color not already appearing among the neighbors of the vertex being colored 

Grinberg condition [303]: necessary for Hamiltonian cycles in planar graphs, that summing 
(length—2) over the inside faces or over the outside faces yields the same total 

Grétzsch graph [205]: the smallest triangle-free 4-chromatic graph 

Grundy number: the maximum number of colors in an application of the greedy coloring algorithm 


Hadwiger conjecture [213]: every k-chromatic graph has a subgraph contractible to K; (true for 
“almost all” graphs) 

Hajés conjecture [213]: every k-chromatic graph contains a K;,-subdivision (false for k > 5) 

Hall’s condition [110]: for every subset S of a partite set X in a bipartite graph, at least |5| vertices 
have neighbors in S ; 

Hall’s theorem [110]: Hall’s condition is necessary and sufficient for the existence of a matching 
that saturates X 

Hamilton tour: Hamiltonian cycle 

Hamiltonian [286]: having a Hamiltonian cycle 

Hamiltonian closure [289]: graph obtained by successively adding edges joining vertices whose 
degree-sum is as large as the number of vertices 

Hamiltonian-connected [297]: having a Hamiltonian path from each vertex to every other 

Hamiltonian cycle [286]: a cycle containing each vertex 

Hamiltonian path [291]: a path containing each vertex 

Harary graphs [150]: a family of k-connected n-vertex graphs with the fewest edges 

Head [53]: the second vertex of an edge in a digraph 

Heawood’s Formula [268]: the chromatic number of a graph embedded on the oriented surface with 


y handles is at most |1/2(7 + ,/1 + 48y)]< 
Helly property [80]: the property of the real line (or trees) that pairwise intersecting subsets have 
a common intersection point 
Hereditary class [226]: a class F such that all induced subgraphs of graphs in F are also in F 
Hereditary family [349]: a family F of sets such that every subset of a member of F is in F 
Hereditary system [349]: a system consisting of a hereditary family and the alternative ways of 
specifying that family Bp 
Hole [340]: a chordless cycle in a graph ea) 
Homeomorphic: two graphs obtainable from the same graph by subdivision of edges 
Homogeneous [380]: in Ramsey theory, a set whose colored pieces have the same color 
Homomorphism: a map f : V(G) + V(#) that preserves adjacency 
Huffman code [103]: prefix-free encoding of data to minimize expected search time 
Hungarian Algorithm [126]: an algorithm for solving the assignment problem 
Hypercube Q, [36]: k-dimensional cube 
Hypergraph [449]: a generalization of graph in which edges may be any subset ofthe vertices 
Hyperplane (matroids) [360]: a maximal closed proper subset of the ground set 
Hypohamiltonian: a non-Hamiltonian graph whose vertex-deleted subgraphs are all Hamiltonian 
Hypotraceable: a non-traceable graph whose vertex-deleted subgraphs are all traceable 


Icosahedron [243]: planar triangulation with 12 faces, 30 edges, and 20 vertices 

Idempotence property (matroids) [359]: o2(X) = o(X) for all X 

Identification: an operation replacing two vertices by a single vertex with the combined incidences 
(same as contraction if the vertices are adjacent) 

Imperfect graph [232]: has x(H) > w(H) for some induced subgraph H 

Incidence matrix [6]: (1) for a graph, the 0,1-matrix in which entry (i, /) is 1 if and only if vertex i 
and edge j are incident; (2) for a digraph, entry (i, j) is 1 if vertex i is the head of edge j, —1 
if it is.the tail, 0 otherwise; (2) in general, the matrix of a membership relation 

Incident [6]: 1) a vertex v and edge e with v € e; 2) two edges with a common endpoint 

Inclusion-exclusion principle [223]: number of objects outside A1,..., An is)’, eta) (7D)! Nie 5 Ai | 

Incomparability graph: the complement of a comparability graph 

Incorporation property (matroids) [359]: r(o(X)) = r(X) 

Indegree [58]: for a vertex in a directed graph, the number of edges of which it is the head 

Independence number a(G) [113]: maximum size of an independent set of vertices 
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Independent domination number [117]: minimum size of an independent dominating set 

Independent set [3]: a set of pairwise nonadjacent vertices 

Indicator variable [427]: a random variable taking values in {0, 1} 

Induced circuit property (matroids) [355]: adding an element to an independent set creates at most 
one circuit 

Induced sub(di)graph G[A] [23]: the sub(di)graph on vertex set A C V(G) obtained by taking A 
and all edges of G having both endpoints in A 

Integer program [323]: linear program plus requirement that variables be integer-valued 

Integrality Theorem [181]: in a network with integer edge capacities, there is an optimal flow 
expressible as units of flow along source/sink paths 

Interlacing Theorem [458]: for each vertex x, the eigenvalues {A;} of G and {y;} of G — x satisfy 
Ay > ey > AQ 2s > Un = An 

Internal vertices [20]: (1) for a path, the non-endpoints; (2) for a plane graph, the vertices not on 
the boundary of the exterior face 

Internally disjoint paths [161]: paths intersecting only at endpoints 

Intersection graph [324]: for a family of sets, the graph having a vertex for each set and having 
vertices adjacent when the sets intersect 

Intersection number [397]: minimum size of aset U such that G is an intersection graph of subsets 
of U (equals minimum number of complete subgraphs covering E(G)) 

Intersection of matroids [366]: the hereditary system whose independent sets are the common 
independent sets in the matroids , 

Intersection representation [324]: an assignment of a set S, to each vertex v such that u @ vif and 
only if S, Sy #@ 

Interval graph [195]: a graph having an interval representation 

Interval number [451]: minimum ¢ such that G has a t-interval representation 

Interval representation of G [195]: 2 collection of intervals whose intersection graph is G 

t-interval [451]: a union of at most t intervals inJR 

t-interval representation [451]: an intersection representation where each assigned set is a [t- 
interval 2 

In-tree [89]: a directed tree in which each edge is oriented toward the root 

Involution [470]: a permutation whose square is:the identity 

Isolated vertex or edge [22]: incident to no (other) edge 

Isometric embedding [400]: a distance-preserving mapping of V(G) into V(H) 

Isomorphic decomposition: decomposition into isomorphic subgraphs 

Isomorphism [7]: a vertex bijection preserving the adjacency relation 

Isthmus: a cut-edge 

Join G V H [138]: the disjoint union G + H plus the edges {uv : u € V(G), v € V(A)} 

Joined to: adjacent to 

Junction: vertex of degree at least three 
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Kempe chain [258]: a path between two vertices that alternates between two colors (particularly 
as used in forbidding minimal 5-chromatic planar graphs) 

Kernel [57, 410]: in a digraph, an independent in-dominating set 

Kernel perfect [410]: having a kernel in each induced subgraph 

Kirchhoff’s current law: net flow around a closed walk is 0 

Kite [12]: simple 4-vertex graph obtained by deleting one edge from K4 

Kénig-Egervary Theorem [112]: maximum matching and minimum vertex in a bipartite graph 
have equal size 

K6énig’s Other Theorem [115]: maximum independent and minimum edge cover in a bipartite graph 
with no isolated vertices have equal size 

Krausz decomposition [285]: edge covering by complete subgraphs using each vertex at most twice 
(leads to the graph for which this is the line graph) 

Kronecker product: tensor product 

Kruskal’s algorithm [95]: grows a minimum weighted spanning tree by iteratively adding the 
cheapest edge in the graph that does not complete a cycle 

Kuratowski subgraph [247]: subdivision of Ks or K3.3 

Kuratowski’s Theorem [246]: a graph is planar if and only if it has no subdivision of Ks or K3,3 
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Labeling: assignment of integers to vertices 

Leaf [67]: vertex of degree 1 

Leaf block [156]: a block containing only one cut-vertex 

Length [20]: the number of steps (or sum of weights) from start to finish 

‘Lexicographic product G[H] [393]: composition 

Line: another name for edge 

Line graph L(G) [168, 273]: the intersection graph of the edges of G, where vertices correspond to 
edges of G and are adjacent if the corresponding edges share a vertex 

Linear matroid [351]: matroid whose independent sets are the sets of independent columns of some 
matrix over some field 

Linear program [179]: problem of optimizing a linear function with linear constraints 

Link: edge 

k-linked: a stronger condition than k-connected, in which for every choice of two k-tuples of ver- 
tices (u1,..., uz) and (v1,..., vy), there exists a set of k internally disjoint paths connecting 
corresponding vertices u;, v;. 

List chromatic index [409]: edge-choosability 

List chromatic number [408]: choosability 

List Coloring Conjecture [409]: edge-choosability always equals edge-chromatic number 

Literal [500]: a logical (true/false) variable or its negation 

S-lobe [211]: a subgraph of G induced by S U V;, where V; is the vertex set of a component of G — S$ 

Local search: technique for solving optimization problems by successively making small changes 
in a feasible solution 

Loop [2]: an edge whose endpoints are the same 

Loopless [6]: having no loops 


ks 


(n, k, c)-magnifier [463]: n-vertex graph of maximum degree k in which gach set S with at most half 
the vertices has at least c|5| neighbors outside S 

Markov chain [54]: discrete system with transition probabilities 

Markov’s inequality [432]: for a nonnegative random variable, Prob (X >t) < E(X)/t 

Martingale [443]: sequence of random variables such that E(X;|Xo,..., Xi-1) = Xi-1 

Matching [107]: a set of edges sharing no endpoints 

b-matching: given a constraint vector b, a subgraph H with dy(v) < b(v) for all v 

Matrix rounding [186]: problem of converting the data and row/column sums in a matrix to nearest 
integers up or down such that row and column sums remain correct 

Matrix-Tree Theorem [86]: subtracting the adjacency matrix from the diagonal matrix of degrees, 
deleting a row and column, and taking the determinant yields the number of spanning trees 

Matroid [354]: a hereditary system satisfying any one of a list of many equivalent properties 

Matroid basis graph [376]: graph whose vertex set is the collection of bases of a matroid, adjacent 
when their symmetric difference has two elements 

Matroid Covering Theorem [372]: the number of independent sets needed to cover the elements of 
a matroid is maxycg [|X|/r(X)] 

Matroid Intersection Theorem [367]: the maximum size of a common independent set in two ma- 
troids on E equals the minimum over X C E of the rank of X in the first matroid plus the 
rank of X in the second matroid 

Matroid Packing Theorem [372]: the maximum number of pairwise disjoint bases in a matroid is 
min,(x)<r(eE) L(|/E| — CA(X))/(r(E) — r(X))J 

Matroid Union Theorem [370]: the union of matroids Mj,..., M, is a matroid with rank function 
r(X) =minycx(|X — Y|+ pe ri(Y)) 

Max-flow Min-cut Theorem [180]: maximum flow value equals minimum cut value 

Maximal clique [31]: a maximal set of pairwise adjacent vertices 

Maximal path or trail [27]: non-extendible path or trail 

Maximal planar graph [242]: equivalent to planar triangulation 

Maximum Cardinality Search [325]: an algorithm for recognizing chordal graphs 

Maximum degree A [34]: maximum of the vertex degrees 

Maximum flow [176]: a feasible network flow of maximum value, or the value itself 

Maximum genus yy(G): the maximum genus surface on which G has a 2-cell embedding 

Maximum (P-object) [31]: for a property P, no larger object of the same type also has property P 
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Menger’s theorems [167-169]: min-max characterizations of connectivity by number of pairwise 
internally-disjoint or edge-disjoint paths between pairs of vertices 

Meyniel graph [330]: any graph in which every odd cycle of length at least 5 has at least two chords 

Minimal imperfect graph [320]: imperfect graph where every proper induced subgraph is perfect 

Minimally 2-connected [175]: deleting any edge destroys 2-connectedness 

Minimum cut [178]: a source/sink cut having minimum value, or the value of such a cut 

Minimum degree 5(G) [34]: minimum of the vertex degrees 

Minimum (P-object) [31]: for a property P, no smaller object of the same type also has property P 

Minimum Spanning Tree (MST) [95]: spanning tree with minimum sum of edge weights 

Minor [251, 362]: graph (or matroid) obtained by deletions and contractions 

Mixed graph: a graph model allowing directed and undirected edges 

Mobius ladder: the graph obtained by adding to an even cycle the chords between vertex pairs at 
maximum distance on the cycle (can be drawn as a ladder with a twist) 

Mobius strip: the non-orientable surface obtained by identifying two opposite sides of a rectangle 
using opposite orientation 

Model A [430]: probability distribution generating simple graphs with vertex set [n] by letting each 
pair be an edge with probability p(n), independently 

Model B [430]: probability distribution making the simple graphs with vertex set [n] and m edges 
equally likely 

rth-moment [433]: expectation of X” 

Monochromatic [886]: in a coloring, a set having all elements the same color 

Monotone graph property [432]: preserved under deletion of edges or vertices 

Multigraph: used by many authors to mean graphs that allow (but don’t require) multiple edges 
and loops (some authors forbid loops from multigraphs) 

Multinomial coefficient [489]: counts arrangements having fixed multiplicities of items; with k; 
items of type i, there are (| k;)!/[ [(ki!) ways to arrange them in a list 

Multiple edges [2]: edges with the same endpoints 
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Nearest-insertion [497]: TSP heuristic to grow accycle 

Nearest-neighbor [496]: TSP heuristic to grow a-path 

Neighborhood N(v) [34]: set of neighbors of v (closed neighborhood N[v] also includes v) 

Neighbors [2]: (noun) the vertices in the neighborhood; (verb) “is adjacent to” 

Net outflow [178]: at a vertex, the total exiting flow minus the total entering flow 

Network [176]: a directed graph with a distinguished initial vertex (source) and a distinguished 
‘terminal vertex (sink), in which each edge is assigned a flow capacity and possibly also a flow 

demand (lower bound) 

Node: vertex, especially in network flow problems 

Nondeterministic algorithm [494]: allowed to “guess” by having parallel computation paths 

Nondeterministic polynomial algorithm [494]: having a polynomial-time computation path for each 

guess of a polynomial number of bits 

Nonorientable surface: a surface with only one side 

Nontrivial graph [22]: having at least one edge 

Nonplanar [243]: having no embedding in the plane 

Nowhere-zero k-flow [207]: a kflow in which all assigned weights are nonzero 

NP [495]: the class of problems solvable by nondeterministic polynomial algorithms 

NP-complete [495]: NP-hard and in NP 

NP-hard [495]: provides a polynomial algorithm for every problem in NP 

Null graph [3]: graph having no vertices 

Numbering: a bijection from V(G) to [n(G)] 


Obstruction: forbidden substructure 

Odd antihole [340]: complement of an odd hole 

Odd component [136]: component with an odd number of vertices 
Odd cycle [24]: cycle with an odd number of edges (vertices) 

Odd graph: the disjointness graph of the k-subsets of [2k + 1] 
Odd hole: chordless odd cycle 

Odd vertex [27]: vertex of odd degree 
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Odd walk [24]: walk of odd length 

Open walk [20]: walk in which the first and last vertex are different 

Optimal tour: a solution to the traveling salesman problem or Chinese postman problem 

Order of graph [34]: the number of vertices 

Ordered ‘graph [406]: a graph with an order relation (usually linear) on the edges 

Order-preserving property [358]: for a function o on the set of subsets of a set, the requirement 
that X C Y implies 0(X) C o(Y) 

Orientable surface: a surface with two distinct sides 

Orientation of graph [62]: a digraph obtained by designating a head and tail for each edge 

Outdegree [58]: for a vertex, the number of edges of which it is the tail 

Outerplanar graph [239]: a planar graph embeddable in the plane so that all the vertices are on 
the boundary of the exterior region 

Outerplane graph [239]: a particular embedding of an outerplanar graph 


Parallel elements [351]: non-loops in a matroid that form a set of rank 1 

Parent [100]: the neighbor of a vertex along the path to the root in a rooted tree 

Parity [473]: odd or even 

Parity subgraph of G [312]: subgraph H such that dy(v) = dg(v) mod 2 for all v € V(G) 

k-partite [5]: same as k-colorable 

Partite set [4]: a set in a vertex partition into independent sets (color class) 

Partition matroid [357]: a matroid induced by a partition of the ground set in which a set is 
independent if and only if it has at most one element from each block of the partition 

Partitionable graph [335]: a graph with aw + 1 vertices where each vertex-deleted subgraph is 
colorable by w stable sets of size a and coverable by a cliques of size w 

Path [5]: a simple graph whose vertices can be listed so that vertices are adjacent if and only if 
they are consecutive in the list ray 

U4, v-path [20]: a path with u and v as endpoints = 

Path addition [163]: a step in an ear decomposition 

Path decomposition [414]: expression of a graph as a union of pairwise edge-disjoint paths 

Paw [12]: simple 4-vertex graph obtained by adding one edge to a claw 

p-critical graph [334]: an imperfect graph whose’proper induced subgraphs are all perfect 

Pendant edge [67]: edge incident with a vertex of degree 1 

Pendant vertex [67]: a vertex of degree 1 

a-perfect [319]: a(H) = 0(H) for evey induced subgraph H 

B-perfect: [335] a(H)w(H) > n(H) for every induced subgraph H 

y-perfect: [319] x(H) = w(H) for evey induced subgraph H 

Perfect elimination ofdering [224]: deletion order such that when each vertex is deleted, its neigh- 
borhood in what remains is a clique (same as simplicial elimination ordering) 

Perfect graph [226]: graph such that x(H) = w(H) for every induced subgraph H 

Perfect Graph Theorem (PGT) [226, 320]: a graph is perfect if and only if its complement is perfect 

Perfect order [331]: a vertex order yielding optimal greedy colorings for all subgraphs 

Perfectly orderable graph [331]: having a perfect order 

Perfect matching [107]: a set of edges such that each vertex belongs to exactly one of them 

Peripheral vertex [70]: a vertex of maximum eccentricity 

Permutation [486]: a bijection from a finite set to itself 

Permutation graph: representable.by a permutation o by v; < v; if and only ifo reverses the order 
of i and j 

Permutation matrix [120]: a 0,1-matrix having exactly one 1 in each row and column 

Petersen graph [12]: the disjointness graph of the 2-sets in a 5-element set 

Pigeonhole principle [491]: every set of numbers has one at least as large as the average 

Pigeonhole property [427]: a finite probability space has an element where the value of a random 
variable is at least as large as its expectation 

Planar graph [5, 235]: a graph embeddable in the plane 

Plane graph [235]: a particular planar embedding of a planar graph 

Plane tree [101]: tree with fixed cyclic embedding order of edges at each vertex 

Planted tree [101]: rooted plane tree 

Platonic solid [242]: bounded regular polyhedron 


528 Appendix D: Glossary of Terms 


Point: vertex 

Polygonal curve [234]: concatenation of segments 

Polyhedron [242]: an intersection of half-spaces 

Polytope: the convex hull of a set of vertices 

Positional game [120]: a game in which the objective is seizing the positions of a winning set 

kth-power (G*): the graph with vertex set V(G) in which u ~ v if and only if dg(u, v) <k 

Predecessor [54]: for v in a digraph, a vertex u with u > v 

Predecessor set [58]: for v in a digraph, the set of precedessors 

Prefix-free code [101]: no code word is a prefix of another 

Prim’s Algorithm [104]: grows a minimum spanning tree by adding a leaf to the current tree in the 
cheapest way 

Principal submatrix: square submatrix using rows and columns with the same indices 

Product dimension [398]: minimum number of coordinate in a product representation of G 

Product representation [398]: encoding of graph such that vertices are adjacent if and only if their 
codes differ in every coordinate 

Proper coloring [192]: (1) for vertices, a coloring in which no edge is monochromatic; (2) for edges, 
a coloring in which edges sharing an endpoint have distinct colors 

Proper subgraph of G [192]: a subgraph not equal to G 

Proper subset of 5 [472]: a subset not equal to S 

Proposal Algorithm [131]: procedure for creating a stable matching 

Priifer code [81]: for a labeled tree, a sequence of length n — 2 obtained by successively deleting the 
leaf with smallest label and recording its neighbor’s label 

Pseudograph: graph model that allows loops and multiple edges, used by authors who define 
multigraphs not to have loops 
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Radius [70]: the minimum of the vertex eccentricities 

Ramsey number [380]: the minimum number of-vertices such that assigning colors to all pairs of 
those vertices produces a monochromatic clique of specified size (or a specified graph) in one 
of the colors 

Random graph [430]: a graph from a probubiliGgeepace most often the space in which each labeled 
pair of vertices independently has probability p of adjacency; typically, p = 1/2 or pis a 
function of n 

Random variable [427]: a variable that takes on a value at each point in a probability space 

Rank (matroids) [349]: for a set of elements, the largest size of an independent set it contains 

Reconstructible [38]: a graph determined (up to isomorphism) by the list of subgraphs obtainable 
by deleting a single vertex 

Reconstruction Conjecture [38]: claim that all graphs with at least 3 vertices are reconstructible 

Rectilinear crossing number: the minimum number of crossings in a drawing of the graph in the 
plane in which all edges appear as straight line segments 

Reducible configuration [258]: forbidden from purported minimal 5-chromatic planar graph 

Reflexive [490]: (1) a digraph with a loop at every vertex; (2) a binary relation R with x Rx for all x 

Region [235]: for an embedding of a graph on a surface, a maximal connected subset of the surface 
that does not contain any part of the graph 

Regular [34]: having all vertex degrees equal 

Regular matroid [351]: representable over every field 

k-regular [34]: having all vertex degrees equal to k 

Representable matroid [351]: linear matroid 

Restriction martingale [445]: martingale in which the value of successive variables is an expecta- 
tion over a shrinking subset of the probability space 

Rigid circuit graph: chordal graph 

Robbins’ Theorem [166]: every 2-edge-connected graph has a strong orientation 

Root [100]: (1) a distinguished vertex; (2) in a branching, the vertex with indegree 0 

Rooted plane tree [100]: a tree with a distinguished root vertex so that children of each non-leaf 
have a specified left-to-right ordering in the plane 

Rotation scheme: a description of a 2-cell embedding; a circular permutation of the edges appearing 
at each vertex, giving their counter-clockwise order around the vertex 
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SATISFIABILITY [499]: the problem of finding truth values for variables to make a logical input 
formula true 

Satisfiable [499]: formula having a “yes” answer in the SATISFIABILITY problem 

Saturated vertex [107]: for a matching, a matched vertex 

Score sequence [62]: the sequence of outdegrees in a tournament 

Second moment method [433]: method for obtaining threshold functions 

Self-complementary [11]: isomorphic to the complement 

Self-converse: isomorphic to the converse 

Self-dual: isomorphic to the dual 

Semi-strong perfect graph theorem [344]: if V(G) = V(H) and a set of vertices induces P, in G if 
and only if it induces P, in H, then G is perfect if and only if H is perfect 

Semipath: an semiwalk in which each vertex appears at most once 

Semiwalk: a sequence of edges (or adjacent vertices) in a directed graph such that each successive 
pair of edges are adjacent, without regard to the orientation of the edges 

Separable: having a cut-vertex 

Separating set: a vertex set whose deletion increases the number of components 

k-set [380]: set of size k 

Shannon Switching Game [365]: a game played on a matroid by the Spanner and the Cutter, one 
trying to seize a set of elements spanning a specified element, the other trying to prevent this 

Shift graph [202]: graph on the 2-subsets of [n] having {i, j} adjacent to {j,k} when i < j <k 

Signed (di)graph: special case of weighted (di)graph, assigning + or — to each edge 

Simple [2]: (1) a graph with no loops or multiple edges; (2) a digraph having at most one edge with 
each ordered pair of endpoints; (3) a matroid having no loops or parallel elements 

Simplicial vertex [224]: (1) a vertex whose neighbors induce a clique; 

Sink [176]: a distinguished terminal vertex, or any vertex with outdegree 0 

Size [35, 473]: (1) the number of edges; (2) the number of elements _ 

Skew partition [347]: a partition X, Y of V(G) such that G[X] and G[Y] are disconnected 

f-soluble [148]: having an edge weighting so that the sum of the weights incident to v is f(v) 

Source [176]: a distinguished initial vertex, or any vertex with indegree 0 

Source/sink cut [178]: a partition of the vertices of a network into sets S,T such that S contains 
the source and T contains the sink Bp 

Span function [358]: the span of a set X ina hereditary system consists of X and the elements not 
in X that complete circuits with subsets of X 

Spanning subgraph: a subgraph containing each vertex 

Spanning set [67]: a set whose span (in a hereditary system on E) is E 

Spanning tree [67}: a spanning, connected, acyclic subgraph 

Spectrum [453]: the list of eigenvalues with multiplicities 

Split graph [345]: a graph whose vertices can be covered by a clique and an independent set 

Splittance: minimum number of edges to be added or deleted to obtain a split graph 

Square of a graph: the second power 

Squashed-cube dimension [401]: minimum length of the vectors in a squashed cube embedding 

Squashed-cube embedding [401]: encodes vertices by 0, 1, +-vectors such that distance between two 
vertices is the number of coordinates where one has 0 and the other has 1 

Stability number [319]: independence number 

Stable matching [130]: a matching having no instance of x and y each prefering the other to their 
current partner inthe matching 

r-staset [447]: stable set of size r 

Stable set [3, 319]: a set of pairwise nonadjacent vertices (same as independent set) 

Star [67]: the tree K,,,-1 with at most one non-leaf 

Star-cutset [333]: separating set inducing a subgraph having a vertex adjacent to all others 

Star-cutset Lemma [334]: no p-critical graph has a star-cutset 

Steinitz exchange property [358]: the property of span functions tl.at if e is in the span of X U f 
but not in the span of X, then f is in the span of X Ue 

Steinitz’s Theorem: 3-connected planar graphs have only one embedding in the plane (more pre- 
cisely, only one dual graph) 

Strength [440]: of a theorem, the fraction of the time when the conclusion holds that the hypothesis 
also holds 
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Strict digraph [294]: a digraph having no loops and at most one edge with each ordered pair of 
endpoints 

Strictly balanced: average vertex degree in subgraphs is maximized only by the full graph 

Strong absorption property (matroids) [355]: if r(X Ue) = r(X) for alle € Y, thenr(X UY) = r(X) 

Strong component [56]: maximal strongly connected subdigraph 

Strong orientation [165]: orientation of G in which each vertex is reachable from every other 

Strong Perfect Graph Conjecture (SPGC) [320]: the conjecture that a graph is perfect if and only if 
it has no odd hole or odd antihole : 

Strong product G, - Ge: a graph product with vertex set V(G1) x V(G2) and edge set (ui, v1) © 
(ug, vg) if uy = ug or uy @ Ug and v}.= ve or v1 © ve 

Strongly connected (or strong) digraph [56]: a digraph with each vertex reachable from all others 

Strongly perfect [330]: a graph in which some stable set meets every maximal clique 

Strongly regular [464]: a k-regular graph whose adjacent pairs have 4 common neighbors, and 
whose nonadjacent pair have 4, common neighbors 

Subconstituent [470]: the subgraph induced by a vertex neighborhood or by a vertex non- 
neighborhood 

Subdigraph [56]: a subgraph of a directed graph 

Subdivision [212]: (1) the operation of replacing an edge by a path of two edges through a new 
vertex; (2) a graph obtained by a sequences of subdivisions. 

H-subdivision [212]: a graph obtained from H by subdivisions 

Subgraph [5]: a graph whose vertices and edges all belong to G 

Submodular function [354]: a function such that r(X UY) +r(X MY) < r(X)+r(Y) for all sets X, Y 

Submodularity property (matroids) [354]: having a submodular rank function 

k-subset [471]: subset with k elements 

Subtree representation [324]: assigns subtrees of a host tree to each vertex of a chordal graph so 
that vertices are adjacent if and only if the-corresponding subtrees intersect 

Successor [54]: for u in a digraph, a vertex v with u>ov 

Successor set [58]: for u in a digraph, the set of successors 

Sum [39]: (1) for cycles and cocycles, same as symmetric difference; (2) for a graph, the disjoint 
union; (3) for matroids on disjoint sets, the matroid on their union whose independent sets 
are all unions of an independent set from each 

Supergraph of G: a graph containing G oO 

Superregular [470]: a regular graph that is null or whose subconstituents are all superregular 

Supply [184]: source constraint in a transportation network 

2-switch [46]: a degree-preserving switch of two disjoint edges for two others not present 

Symmetric [490]: (1) for a graph, having a non-trivial automorphism; (2) for a simple digraph, 
u—>v<#v-— u; (3) for a binary relation R, xRy © yRx 

Symmetric difference AaB [109, 473]: the set of elements in exactly one of A and B 

System of distinct representatives (SDR) [119]: from a collection of sets, a choice of one member 
from each set so that all the representatives are distinct 

Szekeres-Wilf Theorem [231]: x(G) < 1+ maxycg 5(H) 


Tail [53]: the first vertex of an edge in a digraph 

Tait coloring [301]: for a planar cubic graph, a proper 3-edge-coloring 

Tarry’s Algorithm [95]: procedure for exploring a maze 

Telegraph problem [423]: directed version of gossip problem with one-way transmissions 

Telephone problem [422]: gossip problem 

Tensor product: weak product 

Ternary matroid [357]: representable over the field with three elements 

Thickness [261]: the minimum number of planar graphs whose union is G 

Threshold dimension: minimum number of threshold graphs whose union is G 

Threshold function for Q [433]: a function t such that Q almost always or almost never occurs, 
depending on whether the parameter in the model belongs to o(t) or to w(t). 

Threshold graph: having a threshold t and a vertex weighting w such that u # v iff w(u)+w(v) <1; 
many other characterizations, including absence of a 2-switch and existence of a construction 
ordering by adding isolated or dominating vertices 

Topological graph theory: the study of drawings of graphs on surfaces 
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Toroidal [266]: graph having a 2-cell embedding on the torus 

Torus [266]: the (orientable) surface with one handle 

Total coloring [411]: a labeling of both the vertices and edges so that elements that are adjacent or 
incident receive different colors 

Total Coloring Conjecture [411]: every graph G has a total coloring using at most A(G) + 2 colors 

Total domination number [117]: minimum number of vertices in a set § such that every vertex has 
a neighbor in § 

Total interval number: minimum of the total number of intervals used to represent G as the 
intersection graph of unions of intervals on the real line 

Totally unimodular [469]: a matrix in which all square submatrices have determinant 0 or +1 

Toughness [288]: the minimum ¢ such that |S| > t -c(G — S) for every separating set 5, where 
c(G — S) is the number of components of the subgraph obtained by deleting S 

Tournament [61]: an orientation of the complete graph 

Trace [453]: sum of the diagonal elements of a matrix 

Traceable: having a Hamiltonian path 

Trail [20, 59]: a walk in which no edge appears more than once 

Transitive digraph [228]: u > v and v > w together imply u > w 

Transitive closure: (1) for a digraph D, the digraph with u — w whenever there is a path from u to 
w in D; (2) for a relation R, the relation § with xSy whenever there is a sequence xo, ..., xx 
with x = x9Rx,R--- Rx, = y 

Transitivity of dependence (matroids) [359]: e € o(X) and X C a(Y) imply e € a (Y) 

Transportation constraints [184]: supplies and demands 

Transportation Problem [185]: generalization of the assignment problem with supplies at'each 
source and demands at each destination 

Transversal [125]: a system of distinct representatives (this is the word used when the concept is 
generalized); also used for a system of representatives not necessarily distinct 

Transversal matroid [352]: a matroid whose elements are one partite set of a bipartite graph and 
whose independent sets are the subsets saturated by matchings 

Traveling Salesman Problem (TSP) [493]: problem of finding a minimum-weight spanning cycle 

Tree [67]: a connected graph with no cycles E 

k-ary tree [101]: rooted tree with at most k children at each non-leaf vertex 

k-tree [345]: a chordal graph obtained from a k-clique by iteratively adding a vertex whose neigh- 

borhood when added is a k-clique 

Triangle [12]: a cycle of length 3 

Triangle-free [41]: not having K3 as a subgraph 

Triangle inequality: d(x, y) + d(y, z) > d(x,z 

Triangular chord: chord of length two along a path or cycle 

Triangulated graph [225]: a graph with no chordless cycle 

Triangulation [242]: a graph embedding on a surface such that every region is a 3-gon 

Trivalent: having degree 3 

Trivial graph [22]: graph with no edges (some authors restrict to one vertex) 

k-tuple [474]: a list of length k 

Turan graph [207]: an equipartite complete multipartite graph 

Turan’s theorem [208]: charcterization of the complete equipartite r-partite graphs as the largest 
graph of a given order with no r + 1-clique 

Tutte polynomial: a generalization of the chromatic polynomial and of other polynomials 

Tutte’s Theorem [146, 174, 250]: (1) for matchings, characterization of graphs with 1-factors; (2) for 
connectivity, characterization of 3-connected graphs by contractions to wheels; (3) for planar 
graphs, 3-connected planar graphs have embeddings with all bounded faces convex. 

Twins [348]: vertices having the same neighborhood (false twins are adjacent vertices with the 
same closed neighborhoods) 


Unavoidable set [258]: a collection of configurations such that every graph in a specified class 
contains some configuration in the collection 

Underlying graph [56]: the graph obtained from a digraph by treated edges as unordered pairs 

Unicyclic: having exactly one cycle 

k-uniform hypergraph [449]: having only edges of size k 
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Uniform matroid U;,, [357]: matroid on [n] whose independent sets are the sets of size at most k 

Uniformity property (matroids) [354]: for all X C E, the maximal independent subsets of X have 
the same size 

Union (G1 U Gg) [25]: a graph whose vertex set is the union of the vertices in G, and Gz and whose 
edge set is the union of the edges in G; and G2 (written G1 + G2 if the vertex sets are disjoint) 

Union of matroids [369]: the union of matroids M,,...,M, is the hereditary system whose inde- 
pendent sets are {/; U---UX: J; €1;} 

Unit-distance graph [201]: the graph with vertex set R? in which points are adjacent if the distance 
between them is 1 

Unlabeled graph [9]: informal term for isomorphism class 

M-unsaturated [107]: vertex not belonging to an edge of M 

Upper embeddable: having a 2-cell embedding on a surface of genus [(e(G) — n(G) + 1)/2] 


Valence: vertex degree 

Value of a flow [176]: the net flow out of the source or into the sink 

Variance [433]: expected squared deviation from the mean 

Vectorial matroid [351]; linear matroid 

Vertex [2]: element of V(G), the vertex set 

Vertex chromatic number [191]: chromatic number 

Vertex connectivity [149]: connectivity 

Vertex cover [112]: a set of vertices containing at least one endpoint of every edge 

Vertex-critical: deletion of any vertex changes the parameter 

Vertex cut [149, 164]: a separating set of vertices 

Vertex-deleted subgraph [37]: a subgraph obtained by deleting one vertex 

Vertex multiplication [320]: a replacement of vertices of G by independent sets such that copies of 
x and y are adjacent if and only if xy € E (G) 

Vertex partition: a partition of the vertex set = 

Vertex set V(G) [2]: the set of elements on which the graph is defined 

Vertex-transitive [14]: for each pair x, y € V(G), 80ome automorphism of G maps x to y 

Vizing’s Theorem [275]: upper bound on edge-chromatic number in terms of maximum degree and 
maximum edge multiplicity Eh 

Walk [20, 59]: an alternating list of vertices and edges in a graph such that each vertex belongs to 
the edge before and after it (in a digraph, must follow arrows) 

u, v-walk [20]: a walk from u to v. 

Weak elimination property [352]: property of matrices that the union of distinct intersecting cir- 
cuits contains a circuit that avoids a specified point in the intersection 

Weak product G; @ G2: a graph product with vertices V(G1) x V(G2), and edges (uy, v1) < (ug, v2) 
iff uy <> ug and v1 © v2 = 

Weakly chordal [330]: having no chordless cycle of length at least 5 in G or G 

Weakly connected [56]: a directed graph whose underlying graph is connected 

Weight: a real number 

Weighted: having an assignment of weights (to edges and/or vertices) 

Well Ordering Property [19]: every nonempty set (of natural numbers) has a least element 

Wheel [174]: a graph obtained by taking the join of a cycle and a single vertex 

Whitney’s 2-isomorphism Theorem [376]: a characterization of the pairs of graphs whose cycle 
matroids are isomorphic 

Wiener index [72]: the sum of the pairwise distances between vertices 


Zero flow: a flow in a network with flow 0 on every edge 
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Many books have been published about graph theory. Here we list a few for 
the interested reader who seeks an alternative presentation or more detailed 
material on special topics. We list several general textbooks grouped approx- 
imately into three levels. Specialized texts and monographs follow, listed by 
the relevant chapter in this book. Finally, we list some books that present 
additional topics in graph theory. 


General /elementary: 

Chartrand, G. Graphs as Mathematical Models. Prindle~Weber—Schmidt, 1977. Reprinted 
as Introductory Graph Theory, Dover, 1985. 

Clark J. and D.A. Holton, A first look at graph theory. World Scientific, 1991. 

Trudeau R.J., Introduction to graph theory (originally Dots and Lines, 1976). Dover, 
1993. 

Wilson R.J. Introduction to graph theory. Academic Press, 1979, 1972; Longman, 1985. 

Wilson R.J. and J.J. Watkins, Graphs: An rurOGUetOTy approach. John Wiley & Sons, 
1990. 


h_books 


General / intermediate: 

Bondy J.A. and U.S.R. Murty, Graph Theory with Applications. Elsevier, 1976. 

Chartrand G. and L. Lesniak, Graphs and Digraphs. PWS Publishers, 1979; Wadsworth— 
Brooks/Cole, 1986; Chapman & Hall, 1996. 

Gould R., Graph Theory. Benjamin/Cummings, 1988. 

Gross J. and J. Yellen, Graph Theory. CRC Press, 1999. 

Harary F., Graph Theory. Addison-Wesley, 1969. 

Ore O., Theory of Graphs. AMS Colloq. 38, Amer. Math. Soc., 1962. 


General / advanced: 

Berge, C. Graphs. North-Holland 1973, 1976, 1985. (1970, 1983 in French.) 

Bollobas B., Graph Theory: An Introductory Course. Grad. Texts in Math. 63; Springer- 
Verlag, 1979. 

Bollobds B., Modern Graph Theory. Grad. Texts Math. 184; Springer, 1998. 

Diestel R., Graph Theory Grad. Texts Math. 173; Springer-Verlag, 1996, 2000. 

Zykov A.A. Fundamentals of graph theory Nauka, 1987 (Russian). Transl. by L. Boron, 
C. Christenson, and B. Smith, BCS Associates, 1990. 
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Chapter 1: 

Asratian A.S., T.M.J. Denley, and R. Higgkvist, Bipartite graphs and their applications. 
Cambridge Tracts in Math., 131; Cambridge Univ. Press, 1998. 

Fleischner H., Eulerian Graphs and Related Topics, Vols 1 & 2. Ann. Discrete Math. 45 
& 50, North-Holland, 1990 & 1991. 

Harary F., R.Z. Norman, and D. Cartwright, Structural Models: An Introduction to the 
Theory of Directed Graphs. John Wiley & Sons, 1965. 


Chapter 2: . 
Buckley F. and F. Harary, Distance in Graphs. Addison-Wesley, 1990 
Moon J., Counting Labelled Trees. Canadian Math. Congress, 1970. 


Chapter 3: 

Gusfield D. and R.W. Irving, The Stable Marriage Problem: Structure and Algorithms. 
MIT Press, 1989. 

Haynes T.W., S.T. Hedetniemi, and P.J. Slater, Fundamentals of Domination in Graphs. 
Pure and Applied Math. 208; Marcel Dekker, 1998. 

Lovasz L. and M.D. Plummer, Matching Theory. North-Holland, 1986. 


Chapter 4: 

Ahuja R.K., T.L. Magnanti, and J. Orlin, Network Flows. Prentice- Hall, 1993. 
Ford L.R. and D.R. Fulkerson, Flows in Networks. Princeton Univ. Press, 1962. 
Tutte W.T., Connectivity in Graphs. Univ. Toronto Press, 1966. 


Chapter 5: 
Jensen T.R. and B. Toft, Graph coloring problems. Wiley-Interscience, 1995. 
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Chapter 6: 

Aigner M., Graph Theory: A Development from the 4-Color Problem. Teubner, 1984 (Ger- 
man). Transl. by BCS Associates, 1987. 

Bonnington C.P. and C.H.C. Little, The Foundations of Topological Graph Theory. 
Springer-Verlag, 1995. 

Fritsch R. and G. Fritsch, The Four-Color Theorem. Springer, 1994, 1998. 

Gross, J.L. & T.W. Tucker, Topological Graph Theory. Wiley-Interscience, 1987. 

Nishizeki T. and N. Chiba, Planar Graphs: Theory and Algorithms. North-Holland Math. 
Studies 140, Annals Disc. Math. 32; North-Holland 1988. 

Saaty T.L. and P.C. Kainen, The Four-Color Problem: Assaults and Conquests. McGraw- 
Hill, 1977; reprinted Dover, 1986. 

White A.T., Graphs, Groups and Surfaces. North-Holland Math. Studies 8; North- 
Holland 1973, 1984. 


Chapter 7: 

Fiorini S. and R.J. Wilson, Edge-colourings of Graphs. Res. Notes in Math. 16; Pitman, 
1977. 

Voss H.-J., Cycles and Bridges in Graphs. Kluwer Academic, 1991. 

Zhang C.-Q., Integer Flows and Cycle Covers of Graphs. Pure and Applied Math. 205; 
Marcel Dekker, 1997. 


Section 8.1: 

Golumbic M.C., Algorithmic Graph Theory & Perfect Graphs. Acad. Press, 1980. 

Brandstadt A., V.B, Le, and J.P. Spinrad, Graph Classes: A Survey. Soc. Ind. Appl. Math., 
1999. 
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Section 8.2: 
Oxley J., Matroid Theory. Clarendon Press, Oxford Univ. Press 1992. 
Welsh D.J., Matroid Theory. Academic Press, 1976. 


Section 8.3: 
Graham R.L., B.L. Rothschild, and J.H. Spencer, Ramsey Theory. Wiley-Interscience, 
John Wiley & Sons, 1980, 1990. 


Section 8.4: 
Bollobas B., Extremal graph theory. London Math. Soc. Monographs 11; Academic Press, 
1978. (Also treats material of Chapter 5.) 


Section 8.5: 

Alon N. and J. Spencer, The Probabilistic Method. 

Bollobas B., Random graphs. Academic Press, 1985. 

Janson S., T. Luczak, and A. Ruciriski, Random Graphs. Wiley-Interscience, John Wiley 
& Sons, 2000. 

Palmer E.M., Graphical Evolution. John Wiley & Sons, 1985. 


Section 8.6: 

Biggs N., Algebraic graph theory. Cambridge Tracts in Math. 67, Cambridge Univ. Press, 
1974, 1993. Z 

Chung F.R.K. Spectral graph theory. CBMS Reg. Conf. Series in Math. 92; Amer. Math. 
Soc. 1997. 2 

Cvetkovié D.M., M. Doob, and H. Sachs, Spectra of graphs: Theory and Applications. 
Pure and Appl. Math. 87, Academic Press, 1980; 1985; Johann Ambrosius Barth, 
1995. 
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Algorithms and Applications: 

Chartrand G. and O.R. Oellermann, Applied and Algorithmic Graph Theory. McGraw- 
Hill, 1993. 

Chen W.K. Applied Graph Theory: Graphs and Electrical Networks. Series in Appl. Math. 
& Mechanics 13, North-Holland, 1976 (2nd ed.). 

Christofides N., Graph Theory: An Algorithmic Approach. ‘Acad. Press, 1975. 

Even S., Graph algorithms. Computer Science Press, 1979. 

Foulds L.R., Graph Theory Applications. Universitext; Springer-Verlag, 1992. 

Gibbons A., Algorithmic Graph Theory. Cambridge Univ. Press, 1985. 

Gondran M. and M. Minoux, Graphs and algorithms, (translated by Steven Vajda). 
Wiley-Interscience, John Wiley & Sons, 1984. 

Lawler E., J.K. Lenstra, A.H.G. Rinooy-Kan, and D.B. Shmoys, The Traveling Salesman 
Problem. Wiley-Interscience, John Wiley & Sons, 1985, 1990. 

McHugh J.A., Algorithmic Graph Theory. Prentice-Hall, 1990. 

Swamy M.N.S. and K. Thulasiraman, Graphs, Networks, and Algorithms. Wiley- 
Interscience, John Wiley & Sons, 1981. 

Temperley H.N.V., Graph Theory and Applications. Halstead Press, 1981. 

Wilson R.J. and L.W. Beineke (eds.), Applications of Graph Theory. Academic Press, 
1979. 


Additional Topics: 
Beineke L.W. and R.J. Wilson (eds.), Selected topics in graph theory, Vols. 1 & 2 & 3. 
Academic Press, 1978 & 1983 & 1988. 
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Cameron P.J and J.H. van Lint, Designs, Graphs, Codes and Their Links. Lond. Math. 
Soc. Student Texts 22, Cambridge Univ. Press, 1991. 

Capobianco M. and J.C. Molluzzo, Examples and counterexamples in graph theory. 
North-Holland, 1978. 

Berge C., Hypergraphs. N.-H. Math. Lib. 45, North-Holland, 1987, 1989. 

Biggs, N.L., K.E. Lloyd, and R.J. Wilson, Graph Theory: 1736-1936. Clarendon Press, 
Oxford Univ. Press, 1976, 1986. 

Bosak J., Decompositions of graphs. Math. & Its Appl. (East European Series) 47, Kluwer 
Academic Publishers, 1990. 

Brouwer A.E., A.M. Cohen, and A. Neumaier, Distance-regular graphs. Springer-Verlag, 
York, 1989. 

Chung-F.R.K. and R.L. Graham, Erdés on Graphs: His Legacy of Unsolved Problems. 
A.K. Peters, 1998. 

Fulkerson D.R. (ed.), Studies in graph theory, Parts I & II. Studies in Math. 11 & 12. 
Math. Assoc. Amer., 1975. 

Harary F. and E.M. Palmer, Graphical Enumeration. 

Hartsfield N. and G. Ringel, Pearls in graph theory. Academic Press, 1990, 1994. 

Holton D.A. and J. Sheehan, The Petersen graph. Australian Math. Soc. Lect. Series 7, 
Cambridge University Press, 1993. 

Imrich W. and S. Klavzar, Product Graphs: Structure and Recognition. Wiley-Interscience, 
John Wiley & Sons, 2000. 

Lovasz L., R.L. Graham, and M. Grétschel (eds.), Handbook of Combinatorics, Vol. I. 
Elsevier, 1995. 

Mahadev N.V.R. and U.N. Peled, Threshold ¢ 
56; North-Holland, 1995. E 

McKee T.A. and F.R. MeMorris, Topics in intersection graph theory. Soc. Ind. Appl. Math., 
1999. g 

Moon J.W., Topics on Tournaments. Holt, Rinehart, and Winston, 1968. 

Prisner E., Graph Dynamics. Pitman, 1996: 

Scheinerman E.R. and D.H. Ullman, Fractional Graph Theory: A Rational Approach to 
the Theory of Graphs. Wiley-Interscience, John Wiley, 1997. 

Tutte W.T. Graph theory. Encyc Math. & Appl. 21, Addison-Wesley, 1984. 

Yap H.P. Some topics in graph theory. London Math. Soc. Lect. Notes 108, Cambridge 
Univ. Press, 1986. 
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BFS 99, 101, 105, 156 

Bridg-it 73-4, 80, 365 

bridge 1~2, 51, 73-5, 80, 
105, 252, 326 

H-bridge 252 

bridgeless graph 304, 308-9, 
311-3, 317-8 

broadcasting 

Brooks’ Theorem 198-200, 
203, 216, 230, 284 

Brouwer Fixed-Point Theo- 
rem 389 


cactus 160 

cage 49, 79 

canonical labeling 438-9 

capacity 176, 178 

cartesian product (graphs) 
193-4, 199, 265, 284, 296, 
344, 400, 410, 422, 460 

cartesian product (sets) 193, 
474 a 

caterpillar 88=9, 94, 346, 
396, 423 = 

Cayley’s Formula 81-3, 85, 
92-3, 345, 462 

Cayley-Hamilton Theorem 
456-7 

ceiling 39, 483 

cell (in simplicial subdivi- 
sion) 388-91, 395 

2-cell embedding 268, 272 

2-cell 268, 272 

center 72, 78, 81, 105, 393, 
460 

centroid 393 

chain (under a partial order) 
374, 418, 445 

Chairperson Identity 488 

characteristic polynomial - 
453-7, 462, 468 

characterization 23—4, 27-9, 
34, 44-5, 60, 64, 68, 75, 
118, 138, 141, 154, 162-— 
3, 174, 187, 192, 217, 
225, 239, 246, 251-2, 
269, 274, 280, 282, 286, 
310, 323-4, 328, 330-1, 
335, 337, 340, 345, 354, 
358-60, 362, 368, 373, 
378, 461, 472, 495 


Subject Index 


charge 260-1 

Chebyshev’s Inequality 433, 
451 

children (in rooted tree) 
100-2, 106 

Chinese Postman Problem 
99, 105-6, 130, 318 

choice function 408-9 

choice number 408-9, 412 

choosability 408-9 

f-choosable 410 

k-choosable 408-10, 423 

chord 225-6, 234, 240, 245, 
2538, 263, 271, 310, 330, 
341, 343, 347, 412, 437 

chordal graph 224-7, 230-1, 
323-31, 334, 345-7, 423 

chordless cycle 225-7, 323, 
326, 329, 344 

chromatic index 275 

chromatic number 5, 191- 
219, 230, 238, 257, 275, 
283, 309, 319-20, 408-12, 
429, 441-2, 446-7, 449, 
459, 476 

CHROMATIC NUMBER 
501-2 

k-chromatic 192, 196, 200-7, 
210, 213-9 

chromatic polynomial 220, 
221—4, 229-31 

Chvatal’s condition 290-2, 
297-8, 418 

Chvatal’s Conjecture 288 

Chvatal—Erdés Theorem 
292, 297-8, 441 

circle graph 341, 344, 348 

circuit (in graph) 27, 28-34, 
42, 140, 233, 262, 273, 
285, 298-9, 308,.313 

circuit (in digraph) 60-1, 64, 
77, 89-91, 99, 498, 506 

circuit (in matroid) 349-62, 
365, 373-5 

circular-are graph 341, 348 

circulation 187, 190, 308 

CIRCUMFERENCE 416, 
495 

circumference 263, 293, 313, 
416-7 

Class 1 278-9, 284 

Class 2 278 

clause 499-501, 506 


Subject Index 


claw (Ki,3) 12, 15, 18, 37, 
87, 199, 279, 285-6, 333, 
341-3, 348 

claw-free 49, 117-8, 147, 
173, 217, 281-3, 285, 
297, 341-2 

Clebsch graph 466 

CLIQUE 502 


clique 4, 9, 62, 123, 137, 153,. 


173, 192-217, 224-31, 
263, 275, 280-3, 286, 288, 
291, 319-48, 381, 384-7, 
394-400, 413-4, 420, 422, 
426, 439-41, 447-8, 453, 
465, 470, 496, 502 

clique cover(ing) 226, 319-21, 
326, 339, 342, 344, 422 

clique cover(ing) number 
226, 319 

clique decomposition 397 

clique identification 344 

clique number 192, 
199, 231, 319, 335, 339, 
439-41, 447 

clique tree 327-8, 345 

clique-vertex incidence ma- 
trix 328-9, 346 

closed ear 164—5, 172 

closed neighborhood 116, 
341, 468 

closed set (in the plane) 
233-41, 245, 254, 267-8, 
389-90, 397, 452, 468 

closed set (matroids) 360, 
362, 367-8, 371-2, 375 

closed trail 20, 26—7, 30-1, 
34, 57-60, 172—3, 290-1, 
295, 313, 506 

closed walk 20, 24, 32, 48, 
63, 65, 99, 237, 239, 455 

closure (Hamiltonian) 289-— 
90, 298, 419, 449 

closure function (matroids) 
360 

closure operator 360 

co-critical vertex pair 339 

cograph 202, 344 

cobase 360-2 

cocircuit 360, 362, 375 

cocycle (matroids) 362 

cograph 202 

color 4, 191-2, 204, 275, 380 

color class 191-3, 200, 203-4, 
207, 217, 275, 331, 339 


color sum 204 

color-critical 192, 199, 206, 
210, 215, 218, 344 

2-COLORABILITY 495, 505 

3-COLORABILITY 500, 
504-5 

k-COLORABILITY 495, 501, 
505 

k-colorable 191-2, 202, 204, 
211, 309, 363, 408 

k-coloring 191—4, 198, 200, 
205, 207, 210-1, 216-7, 
219-24, 229, 309, 380-3, 
386, 393-4, 449 

column matroid 351-2, 375 

combinatorial design 11, 
465, 470 

common system of distinct 
representatives (CSDR) 
119, 171-2, 353, 368~—9 

comparability graph 228, 
231, 329-31 

compatible pair 232 

complement (graph) 4, 10- 
2, 15, 38, 49, 52, 71, 77, 
80, 115, 121, 201, 207, 
215-6, 226-7, 245, 255, 
283, 297, 312, 320, 322, 
334-5, 340-1, 344, 360-2, 
366, 375, 379, 393, 400, 
422, 456, 461, 465-70 

complement (set) 474 

complement reducible 344 

complete bipartite graph 
9-10, 14, 33, 41, 104, 
409, 413, 416 

complete graph 9-11, 16, 
26, 32, 50, 62, 79, 83-4, 
87, 104, 108, 149, 193- 
4, 197-204, 207, 214, 
217-8, 221, 224, 230-1, 
263, 290, 293, 298, 329, 
336, 344, 381, 386, 398- 
9, 419, 426, 459, 470, 487 

complete loopless digraph 
393 

complete multipartite graph 
207, 215 

complete subgraph 26, 280-1, 
381, 386, 397 (see clique) 

completely labeled cell 388— 
9, 391, 395 

complexity 125, 269, 286, 
425, 494, 496, 499 
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component 22-32 

composition (of functions) 9, 
18, 485-7 

composition (of graphs) 284, 
332—4, 393 

conclusion 477 

conditional statement 248, 
477-9, 481 

conditional probability 443, 
448 

configuration (reducible) 
258-61, 265, 270 

conflict graph 252, 254, 256 

congruence (modulo n) 52, 
64, 88, 94, 194, 204, 217, 
269, 272, 274, 303, 309, 
464, 490-1 

conjunction 477 

connected dominating set 
117, 122-3 

connected graph 5, 21 

2-connected graph 150, 155, 
158, 161-4, 173-5, 198, 
204, 213, 240, 243-4, 
247-8, 250, 252-4, 287-8, 
293, 295-6, 298, 312-4, 
317, 348, 417-9 

3-connected graph 150, 158-9, 
166, 174-5, 213, 218, 237, 
247-52, 256, 292, 295, 
301-4, 316, 376, 505 

k-connected 149, 151, 158-62, 
164, 169-70, 174-5, 283, 
298, 440, 450-1 

connected to 21-2, 31 

connection relation 21-2, 29, 
34, 59, 63 

CONNECTIVITY 149, 152, 
164, 439, 495 

connectivity 149-53, 158-9, 
163-9, 174, 182, 211, 215, 
248, 274, 292, 301-2, 304, 
313-4, 406, 439-41, 463 

connector 391-2 

consecutive 1s property 
328-9, 346-7 

conservation constraints 
176-7, 184, 186-8, 307 

consistent rounding 186, 190 

construction procedure 30, 
324 

contains 6, 21, 24, 471 

contraction (edge) 84-5, 
143-5, 2138, 218, 221-3, 
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239, 241, 249-51, 256, 
269, 305, 317, 324, 

contraction (in matroids) 
363-6, 375-7 

contrapositive 38, 77, 110-1, 
159, 200, 249, 290, 324, 
478, 491 

converse (of conditional) 477 

convex combination 395 

convex embedding 248-50, 
255 

convex function 443 

convex polygon 247-8, 256 

copy 10 

cost 95-7, 100, 103, 126-30, 
185, 494, 496-500, 505 

counting arguments 34-7, 
47-50, 68, 79-85, 92, 108, 
111, 138, 219, 223-4, 
229, 241, 263, 272, 279, 
322, 335, 385, 420, 427, 
436, 458, 463, 473, 485-9 

Coupon Collector 451 

cover/covering (see edge 
cover, vertex cover, etc.) 

COVERING CIRCUIT 506 

covering set 127-8 

critical 94, 122, 147, 192, 
196, 198-9, 201, 203, 
206, 210-3, 215, 217-8, 
334-6, 339-44, 348, 506 

a-critical 122, 506 

k-critical 192, 196, 198-9, 
203, 210-3, 215, 217-8 

critical edge 122, 340, 342-3 

critically connected 

cross edges (in Petersen 
graph) 276-7 

crossing 234 

crossing number 262-4, 269 

cryptomorphism 360 

CSDR (see Common Syst. of 
Distinct Representatives) 

cube Q3 3, 35-6, 49, 51, 76, 
105,-119, 150, 236, 243, 
255, 271, 295-6, 379, 390, 
397, 401-3, 422, 468 

cubic graph 304-11 

curve 1-2, 48, 54, 233-9, 
241, 245-6, 254, 268 

cut (see edge cut, source/sink 
cut, vertex cut) 

x, y-cut 166-8, 172 


cut-edge 23, 43—4, 48-9, 52, 
68-70, 75, 77, 104, 139, 
147, 155, 158, 165, 173~— 
5, 237-8, 300-1, 304, 
307-8, 313 

cut-vertex 23, 29, 31-2, 77, 
146, 155-6, 158, 160, 
162, 198, 212, 240, 243, 
247, 284, 420, 506 

cycle 5-6, 9-20, 23-37, 43, 
49, 55-60, 63-71, 75-9, 
84—7, 96, 103-5, 108-10, 
118~—9, 122, 140, 147, 155, 
159-65, 170-5, 192-200, 


203-4, 218, 216-7, 224-35, 


238-45, 250-9, 270-7, 
284-306, 310-8, 323, 
326-30, 341-4, 349-65, 
373--76, 379, 391, 394-5, 
408-24, 429, 436-7, 440— 
1, 452-5, 460, 467-8,, 
492-4, 497-9, 502, 505 

n-cycle 9, 12, 35, 49, 92, 94, 
306, 417-8, 460, 468 

4-cycle 14, 23,25, 34, 48— 
9, 70, 94, 193, 221, 223, 
228, 270, 305, 329, 345, 
394-5, 408; 460, 467, 505 

5-cycle 11-4, 18, 50, 92, 108, 
114, 119, 142, 192-3, 199, 
205-6, 210, 215, 234, 
252, 270, 276-7, 312, 318, 
323, 336, 344-5, 348, 
384, 394, 422, 460, 470 

6-cycle 10, 37, 49, 216, 234, 
318, 487 

cycle double cover (CDC) 
312-4, 317-8 

cycle matroid 313, 350-5, 
358, 360, 362—5, 373-6, 
406 

cycle space 313, 452, 467 

cycle-power 337-432 


deadheading 130 

de Bruijn cycle 60, 94 

de Bruijn graph 61, 63 

decision problem 494-5 

decomposition 11-2, 18, 25, 
30-1, 34, 56, 64, 76, 87-8, 
94, 140, 147, 155, 163-5, 
172-5, 248, 252, 261, 271, 
276, 280-1, 284, 286, 302, 


Subject Index 


314, 324, 371, 397-8, 
413-5, 460 
F-decomposition 397, 413-4 
decomposition procedure 324 
deficiency 121, 146 
defined on 483-4 
k-degenerate graph 269 
degree (of vertex) 6 
degree sequence 44-6, 59, 62, 
76, 94, 141, 195, 290-1, 
297-8, 345, 418, 438 
degree set 
degree-sum formula 35, 40, 
43-4, 51, 58, 214, 238, 
242, 365, 385 
deletion (G —e, G—v) 23 
deletion (matroids) 362-6 
deletion method 428-9, 
449-50 
demand 130, 184, 187 
density bound 390-1, 396 
density of graph 435-6 
dependence (linear) 400, 457 
dependence (matroids) 352, 
359, 373 
dependent edge 232 
dependent sets 313, 349-50 
depth-first search (DFS) 
156-7, 402, 404 
descendant 100 
determinant 85~—7, 92, 452— 
4, 462, 469 
diagonal Ramsey number 
385, 394, 450 
DIAMETER 495 
diameter 71—2, 75~—9, 99, 
105, 114, 122, 147, 153, 
160, 209, 216, 244, 379, 
396, 424, 432, 458, 464 
difference (of sets) 473 
digraph 53 


-Dijkstra’s Algorithm 97-100, 


105 
dilation 390 
Dilworth’s Theorem 413, 424 
k-dimensional cube Q; 35-6, 
48-9, 71, 76, 105, 108, 
119, 150, 174, 193, 282, 
296, 329-30, 379, 390 
k-dimensional simplex 395 
Dinitz Conjecture 410 
Dirac’s Theorem 218, 417-8, 
441 
direct method (of proof) 478 


Subject Index 


direct sum (matroids) 
369-70, 406 

directed graph 53, 66, 90, 
189, 377, 406, 422, 506 

DIRECTED HAMILTONIAN 
CYCLE 503 

DIRECTED HAMILTONIAN 
PATH 500, 502-3 

Directed Matrix Tree Theo- 
rem 89 

discharging 261, 304 

disconnected graph 6, 12, 
15, 21-2, 25, 31, 38-9, 
50, 52, 63, 71, 78, 85, 
149-50, 156, 165, 178, 
241, 247, 249, 333, 347, 
431-2, 437, 470 

disconnecting set 152, 155, 
159, 168 

discrepancy 402-3 

discrete system 54 

disjoint (sets) 473 

disjoint union (G + H) 39, 
48, 104, 137-8, 155, 193, 
199, 271, 306, 313, 359, 
371, 399, 419, 465, 470 

disjointness graph 13-4, 
17-8, 276 

disjunction 477 

distance 5, 46, 57-8, 70-3, 
78, 95, 97-9, 105, 130, 
137, 190, 192, 198, 201, 
209, 217, 225, 235, 246, 
265, 271, 294, 302, 345, 
379, 390-2, 400-3, 419, 
421, 449, 452, 468 

distance-preserving 400-1 

distinct 489 

DNA chains 328 

dodecahedron 243, 245, 286, 

295 

domain 437, 483-5 

dominating set 116-8, 122— 
3, 428-9, 506 

domination number (see 
dominating set) 

dot product 86, 306, 317, 
338, 400 

double cover 312—4, 317-8 

double jump 437 

double torus 267-8 

double triangle 281-2, 
285-6 

double-star 78 


double-torus 266 

doubly stochastic matrix 120 

drawing 2, 9-12, 30, 54, 
233-5, 242, 262-6, 272, 
449, 504 

dual augmentation property 
362 

dual edge 236, 238, 244-5, 
360, 363-4 

dual graph 236-9, 241-5, 
300, 309, 314-5, 317, 
360, 376 

dual matroid 349, 360-5, 
375-7 

dual problem (optimization) 
113—4, 118, 125-6, 135, 
166, 172, 179, 188, 323 

Duality Theorem 323 


ear 163-5, 172-3, 175, 248 
eccentricity 71-2, 78, 99, 105 
edge 2, 53 
edge cover 114-5, 122 
edge cut 152-5, 159-60, 
164-5, 181,190, 211, 
238, 283, 301, 303-7, 
312, 317, 452, 467 
edge-choosability 409 
edge-chromati¢ number 275, 
283 3 
edge-coloring 274-9, 282-5, 
296, 299-305, 310-1, 381, 
409 
k-edge-colorable 275, 296, 
411 
3-EDGE-COLORABILITY 
505 
k-edge-coloring 275, 284-5, 
296, 381 
A(G)-EDGE-COLORING 
2-edge-connected 164-5, 
172-3, 243, 300-2, 305, 
312—4, 317, 424 
k-edge-connected 152, 158, 
160, 164-6, 174-5, 283 
edge-connectivity 152-3, 
165-9, 274, 301-2, 406 
edge-transitive 18 
Edmonds’ Blossom Algo- 
rithm 144 
Edmonds’ Branching Theo- 
rem 405-6, 422 
eigenvalue 401, 453-70 
eigenvector 453, 455-70 
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element (of set) 471 

embedding 234-56, 266-72, 
283, 302, 313, 376, 400- 
1, 453 

empty set 472 

encoding 26, 101, 389, 397— 
8, 400-3, 494 

endpoint 2, 20, 53 

entropy 103 

equality of sets 472 

equality relation 490 

equality subgraph 126-9 

equitable edge-coloring 285 

equivalence class 9, 22, 33, 
63, 173, 313, 490-1 

equivalence relation 8-9, 22, 
63, 173-4, 490 

erasure 43, 53 

Erdés—Faber—Lovasz Conjec- 
ture 202 

Erdés—Gallai condition 141, 
148, 185 

Erdés—Szekeres Theorem 
203, 379, 382 

Euler’s Formula 233, 241-2, 
245, 255, 268, 272, 316, 
375 

EULERIAN CIRCUIT 495, 
499 

Eulerian circuit 27-34, 42, 
60-1, 64, 77, 89-91, 99, 
140, 273, 285, 298-9, 498 

Eulerian digraph 60, 64, 90- 
1, 130 . 

Eulerian graph 27-31, 34, 
60, 77, 244, 295, 298, 
308, 495 

Eulerian trail 27, 60, 64 

even cycle 109-10, 138, 174, 
204, 217, 276, 318 

even digraph 318 

even graph 27-31, 33-4, 48, 
50, 308, 311-3, 414 

even numbers 472-3 

even pair 348 

even triangle 281 

even vertex 26, 36, 100, 140 

even walk 24 

event (probability) 425-7, 
443-50 

evolution 193, 436-8 

excess (matrix) 126-30, 141, 
176-7, 179-80 

existential quantifier 475-6 
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expander 453, 463, 469 

Expansion Lemma 162, 170, 
175 

expansion operation 43—4, 
52-3, 175 

expansive property 358-60 

expectation (of random vari- 
able) 427-34, 440, 443-6, 
449, 452 

extremal problem 38-9, 41, 
116, 209, 396, 413 

extremality method 28-9, 
32, 34, 40-1, 63, 68, 137, 
249, 289, 294, 299 


face 235-50, 253-6, 267-72, 
295, 300-3, 307-9, 312-5, 
353, 360, 401, 412, 424 

face length 238-9, 241 

face-coloring 300-1, 307, 309 

factor 136 

1-factor 186-41, 145-8, 159, 
276, 283-4, 308, 310, 318 

2-factor 136, 140, 147, 276- 
7, 285, 288, 315 

f-factor 140-1, 148 

k-factor 140, 146 

1-factorable 276, 284 

1-factorization 276, 279, 
284-5, 310 

1-factorization Conjecture 
279, 284-5 

1-factorization 276, 279, 
284-5, 310 

factor-critical 147 

factorial 107, 220, 294, 386, 
428, 434-5, 486-9 

fan 170-1, 213 

Fary’s Theorem 247, 251, 
255 

fat triangle 275 

feasible flow 176-80, 184—8 

feasible solution 323, 497 

finite automaton 54 

finite graph 3 

finite set 473 

finite state machine 54, 57 

Five Color Theorem 257-8 

flat 266-8, 360 

floor function 39, 483, 491 

flow (in network) 176-89, 
495 

flow (in graph) 307-18 

flow number 309 


k-flow 307-12, 317-8 

k-flowable 309 

flower 142, 306, 317 

forbidden substructure 323, 
365 

Ford—Fulkerson Labeling Al- 
gorithm 179-82, 186-9, 
438-9 

Ford—Fulkerson Theorem 
180-5 

forest 67, 75-80, 96-7, 104, 
160, 206, 214, 217, 219, 
244, 297, 327, 345, 351, 
353—4, 362-3, 372, 413, 
424, 434, 436, 468 

Four Color Theorem 213, 
259-61, 268-70, 300-4, 
311, 314, 411, 469 

H-fragment 252-4, 256 

fraternal orientation 345 

H-free 41, 348 

P,-free 52, 202, 344, 347 

free matroid 357 

Friendship Theorem 453, 
465-7 5 

Fulkerson’s Conjecture 318 

function 483 — 

functional digraph 55, 64 

fundamental set of circuits 
374 


teleg 


Gale—Ryser Theorem 185, 
190 

Gale-Shapley Algorithm 
131-2, 135-6 

Gallai’s Theorem 376 

Gallai—-Roy—Vitaver Theorem 
196 

Gallai—-Milgram Theorem 
413 

gambler 444-5 

games 48, 51, 57, 73—4, 106, 
119-20, 183-4, 274, 286, 
366, 445 

gammoid 377 

gas—water-electricity 233 

generalized coloring 199 

generalized cover 146 

generalized partition ma- 
troid 370 

generalized Petersen graph 
316 

GENUS 266, 495 

genus 266-7, 272, 283 


Subject Index 


geometric dual 365, 376 

Gewirtz graph 466 

Ghouila-Houri’s Theorem 
420 

girth 13-4, 17, 37, 49, 
79, 105, 119, 147, 206, 
216-7, 219, 232, 245, 
255, 297, 304-6, 312-4, 
365, 396, 412, 429 

good algorithm 124-5, 142, 
196, 219, 274, 276, 279, 
292, 493-4, 504 

good characterization 495 

gossip problem 406-8, 422-3 

graceful labeling 87—8, 92-4 

Graceful Tree Conjecture 87, 
94 

graph 2 

graph transformation 64, 
138, 141, 285, 422 

graphic matroid 350, 357, 
375-6 

graphic sequence 44—5, 48, 
148, 185 

greedy algorithm 96, 116, 
195, 349, 354-7, 366, 
373-4, 429, 441-2, 496-7 

greedy coloring 194-202, 
227, 276, 324, 331-2, 
344, 442, 459 

greedy decomposition 397-8 

greedy ear decomposition 
173 

grid (P», O P,) 193, 316, 390, 
396 

grid (positions) 73, 251, 265, 
370, 410-1, 425, 446, 
460, 490 

Grinberg graph 302, 316 

Grinberg’s condition 303, 
315-6 

group 18, 309, 449, 452-3, 

growth rate 265, 431, 483 

Grétzsch graph 205-6, 215, 
218, 294 

Grétzsch’s Theorem 270 

GyArfas—Sumner Conjecture 
206, 215 


Hadwiger’s Conjecture 213 

Hajoés’ Conjecture 213, 414, 
442 

Hajés’ construction 217 


Subject.Jndex 


Hall’s Condition 110-3, 121, 
146-7, 368, 377, 463 


Hall’s Theorem 110-3, 120-1, 


146—7, 171, 175, 219, 376 
Hamiltonian closure 298, 
419, 449 
HAMILTONIAN CYCLE 
494-500, 503, 505-6 
Hamiltonian cycle/graph 
286-99, 302-4, 314— 
7, 395-6, 416~—21, 437, 
440-1, 449, 493-4, 497-9, 
502-3, 506 
HAMILTONIAN PATH 495, 
500, 502-3, 505-6 
Hamiltonian path 292, 295- 
7, 299, 303, 316-7, 428, 
497, 502 
Hamiltonian-connected 
297-8 
handle 266-8, 313 
handshake party problem 
481 
Harary graph 150, 153 
Harper’s bound 390-1 
Havel—Hakimi Theorem 45, 
52, 59 
head 53-61, 86, 90, 94, 164— 
5, 168, 178, 307-8, 357— 
8, 406, 484, 503 
head partition matroid 357 
Helly property 80, 346 
hereditary family of graphs 
226-8, 275, 325, 332, 
334, 341, 
hereditary family of sets 
349, 353, 357, 371 
hereditary system 349-55, 
357-63, 366, 369-71, 
373-4, 377 
heuristic algorithm 496 
homogeneous set 380-1 
Huffman’s Algorithm 101-3 
Huffman code 103, 106 
Hungarian Algorithm 127-9, 
132, 134-5 
hunter/farmer problem 121 
hypercube 35-6, 49, 71, 108, 
122, 150, 174, 193, 350 
hypergraph 449 
hyperplane 360-2, 375, 395 
hypobase 360-1, 375 
hypothesis 477 


icosahedron 214, 243, 315 

ideal (of sets) 349 

idempotence property 359 

if (in definitions) 473 

image 8, 55, 147, 234, 377, 
401, 483-6 

imperfect graph 232, 320-3, 
333-6, 343-4, 347 

in-neighborhood 58 

in-tree 89-91 

incidence matrix 6, 17, 56, 
86, 323, 328-9, 337, 346, 
375, 469 

incidence relation 234, 322, 
489-90 

incidence vector 338, 452 

incident 6 

inclusion-exclusion principle 
223, 230 

incorporation property 359-— 
61, 367, 374 

increasing path 406—7, 423 

increasing trail 393 

indegree 58-65, 89, 130, 
190, 331, 404, 410, 503 

independence number 113- 
4, 192, 194, 199, 319, 441 

independent dominating set 
117-8, 122-3 

independent events 426 

independent set (in graph) 
4-5, 9-10, 15, 23—5, 29, 
32, 36—7, 75, 113-8, 121—- 
2, 192-4, 199, 203, 205, 
208, 211, 215-6, 218- 
21, 226, 230, 273, 293, 
319, 333, 384-5, 393, 
395, 410-1, 413, 428-9, 
449, 493-6, 502, 506 

independent set (in matroid) 
349-77 

INDEPENDENT SET 494— 
6, 500, 502 

k-INDEPENDENT SET 495 

independent vectors 353, 
400 

index of summation 485 

indicator variable 427-34, 
448, 452 

indirect proof 151, 478 

induced by 23 

induced circuit property 
354-5, 360, 374 
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induced subgraph 23, 32— 
4, 37, 41-2, 50, 64, 75, 
175, 204, 211, 219, 225- 
6, 231, 281, 285-6, 315, 
319-21, 324, 330-4, 340- 
1, 348, 345, 410, 434, 
450, 454, 458-9, 470 

induction 19-21, 24~-34, 40-— 
7, 479-83 

induction hypothesis 19, 
479-82 

induction parameter 19, 42, 
480 

induction step 19, 479-82 

induction trap 42—4, 68, 
481-2 

infinite graph 3 

infinite set 473 

integer 471, 474 

integer lattice 393, 474 

integer linear program 323 

integrality condition 465, 
470 

Integrality Theorem 181, 
183 

Interlacing Theorem 458 

internal vertex 20-1, 69, 
72, 151, 161, 163, 166-7, 
173, 177, 270, 412, 415 

internally disjoint paths 
158, 161-2, 166-75, 182— 
3, 212, 218, 274, 417 

intersection graph 324, 327-— 
8, 341, 344-5, 451 

intersection number 397 

intersection of matroids 366 

intersection of sets 473 

intersection representation 
324, 345, 397 

t-interval 451 

interval graph 195-6, 204, 
224, 226-7, 231, 328, 
330, 346-7 

interval number 451 

interval representation 195— 
6, 226, 328-9, 346 

intractable 495 

inverse 18, 53, 267, 390, 484 

inversion 33 

involution 470 

isolated vertices 22, 31-2, 
51, 60, 77, 80, 90, 96, 
114-8, 121-2, 138, 155, 
210, 223, 230, 376-7, 
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398-9, 408, 414, 422-3, 
433-4, 437, 451, 455 
isometric embedding 400-1 

isomorphic to 7 
isomorphism 7-17, 38, 49, 
56, 75, 78, 81, 92, 94, 

207, 234, 243, 276, 364, 
430, 438-9, 441, 453, 
485, 490 
isomorphism class 9, 12-3, 
81, 207, 234 
isomorphism relation 8-9 


join G v H 138, 146, 150, 
155, 193, 199, 210, 215- 
6, 236, 264, 271, 291, 
298, 310, 334, 360, 380, 
387, 436-7 

joined 22 

Jordan Curve Theorem 235, 
238, 241, 258, 301 

junk 328, 340 


Kz; 10-2, 26, 138, 155, 220- 
1, 240, 286, 344, 384, 
386-7, 395, 467 

K4 11-2, 25, 31, 48-4, 53, 
175, 209, 212-3, 215, 
218, 236, 240, 250, 256, 
272, 302, 314, 349, 352, 
357, 374, 401 

Ks 9-12, 140, 214, 234, 242- 
3, 246-7, 250-2, 256, 
258, 263, 267, 269, 283, 
363, 365 

K3.3 10, 43, 150, 159, 233-4, 
242-3, 246-7, 250-2, 256, 
267, 269, 272, 363, 365, 
422 

Kempe chain 258-60 

kernel 57-8, 64, 410-1 

kernel-perfect 410-1 

king 62-3, 65-6, 190, 450 

kite 12, 23, 50, 84-6, 92, 
223-4, 279-81, 349, 397 

knight’s tour 295 

Kotzig’s Theorem 284 

Krausz decomposition 286 

Kruskal’s Algorithm 95-7, 
104, 498 

Kuratowski subgraph 247— 
52, 255 

Kuratowski’s Theorem 246— 
8, 251-2, 255-6, 269, 364 


Konig’s Other Theorem 376 

Konig—Egervary Theorem 
113-5, 121, 123, 146, 
168, 174, 189, 210—1, 
227, 368, 373, 413, 424 

Koénigsberg Bridge Problem 
1-2, 19-20, 26 


Lagrangian multiplier 456 

Laplacian matrix 463, 469 

Laplacian eigenvalue 469 

Las Vergnas’ condition 298, 
418 

lattice 349, 360, 393, 474 

leaf 67-73, 76, 80-3, 86, 89, 
93, 101-3, 106, 115, 156, 
174, 198, 214, 219-20, 
324-5, 331, 468 

leaf block 156, 198 

left child 101 

left subtree 101 

length (of object in graph) 
20,237. 

length (of encoding of graph) 
397, 398, 401 

lexicographic ‘product 393 

lg 97, 202, 400, 422-3, 434, 
449,451 <= 

line digraph 168 

line graph 168, 227, 273-5, 
279-86, 295, 320, 409, 
422, 493, 505-6 

linear matroid 351, 353, 
357, 360 

linear programming 179, 
376 

linearity of expectation 427- 
8, 432 

list 474 

list k-colorable 408 

list chromatic index 409 

list chromatic number 408- 
12 

list coloring 199, 408—12, 
423 

list edge-coloring 409-11 

literal (in logical formula) 
499-501, 506 

S-lobe 211-3, 218, 247~—8 

local density 390, 396 

local search 497 

logical formula 499-500 

longest cycle 173, 292, 294, 
298, 416 
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longest path 34, 71, 147, 
196-7, 228, 294, 416-9 
loop (in graph) 2,6, 8, 20, 24, 
27, 34-5, 44, 69, 76-7, 

84—5, 107, 149, 192, 223, 
236-9, 241, 267-8, 275, 
284, 286, 294, 300, 305 

loop (in digraph) 54, 58-9, 
61, 64, 299 

loop (in matroid) 351, 366-7, 
370, 372-3, 375 

loopless digraph 56, 62, 64, 
66, 89, 302, 393, 413, 420 

loopless graph 6, 17, 34-5, 
40, 44, 49-52, 75, 85-6, 
155, 192, 203, 265, 275, 
285, 302 


Mader’s Theorem 175 

magnifier graph 463—4, 469 

map (in the plane) 1, 5-8, 
191, 219, 233-8, 258-60 

Markov chain 55 

Markov’s Inequality 432-3, 
444, 448, 452 

Marriage Theorem 111 

martingale 443-7, 452 

martingale tail inequality 
443-7, 452 

MATCHING 495, 506 

matching 100, 107-48, 150, 
166, 168, 175, 179-80, 
211, 216, 227, 273-6, 
283, 295-6, 308, 310, 
318, 349, 352, 357, 366, 
368-9, 373-7, 395, 399, 
408, 411, 419, 424, 436, 
449, 451-2, 463, 468-9, 
493, 499, 505-6 

mates 107, 131, 337-40 

matrix 490 

matrix rounding 186, 190 

Matrix Tree Theorem 85-6, 
89, 92-4, 453, 462-3, 469 

0,1-matrix 120, 322, 328, 
454 

matroid 74, 313, 349-78, 
406, 506 

matroid basis graph 376 

Matroid Covering Theorem 
372 

Matroid Intersection Theo- 
rem 366-71, 376-7, 413 
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3-MATROID INTERSEC- 
TION 506 

Matroid Packing Theorem 
372 

Matroid Partition Problem 
378 

Matroid Union Theorem 
‘366, 369-72, 377-8 

Max-flow Min-cut Theorem 
180-5 

maximal 29 

maximal clique 37, 231, 281, 
323, 327, 329-31, 345 

maximal forest 351 

maximal matching 108, 118, 
122 

maximal outerplanar graph 
243, 256 

maximal path 27-9, 31, 34, 
60, 163, 204, 293, 298 

maximal planar graph 242— 
3, 245, 271 

maximal trail 29, 31, 33, 64 

maximum 29 

maximum clique 215, 322, 
333, 336—40, 342-3, 347— 
8, 439 

maximum degree 29, 34, 41, 
47-8, 52, 67, 75-9, 114, 
200, 202, 204, 251, 256, 
284-5, 291, 345, 390, 
439, 463, 476 

maximum density 435 

maximum flow 1, 176-83, 
186-8 

MAXIMUM MATCHING 
495 

maximum matching 100, 
108-29, 132—4, 139, 141- 
2, 145, 147, 349, 493 

maximum stable set 321, 
323, 336-40, 343-4, 348 

maximum independent set 
29, 31, 114-5, 121-2, 
203, 356, 496 

maximum weighted match- 
ing 125-30 

median 78 

member (of set) 471 

Menger’s Theorem 167-75, 
181-3, 189, 227, 274, 
368, 377, 404—6, 422, 495 

method of contradiction 478 

Meyniel graph 330—1, 347-8 


Meyniel’s Theorem 420, 424 

Min-cost Flow Problem 185 

min-max relation 113, 138, 
274, 320, 323, 366, 368, 
371, 495 

minimal imperfect graph 
320, 322, 333-4, 347 

minimal nonplanar graph 
247-8 

minimal vertex separator 
231, 345 

minimally k-connected 175 

minimally k-edge-connected 
175 

minimax 104 

minimum cut 167, 179, 182-— 
3, 189 

minimum degree 34, 49, 51, 
70, 79, 116-7, 122, 152- 
3, 159, 202, 213-4, 218, 
243, 245, 256, 261, 285, 
288-9, 293, 296-8, 343, 
386, 428-9, 440, 457, 496 

minimum polynomial 457-8, 
461 os 

MINIMUM SPANNING CY- 
CLE 494 > 

minimum spanning tree 
(MST) 95, 104, 349, 496, 
498 3 

MINIMUM VERTEX 
COVER 496 

minimum weighted cover 
126-9 

minor (of graph) 251, 256, 
269 

minor (of matroid) 363, 365, 
375, 462 

Minty’s Theorem 203 

Model A 430~5, 438, 446-7 

Model B 430, 433-4, 436 

modular 3-orientation 317 

modulus 490 

monochromatic 386-7, 393— 
5, 449-50 

monotone property 433 

monotone subsequence 203, 
390 

monotone tournament 393 

mountain range 48 

multigraph xiv 

multinomial coefficient 489 

multiple edges 2, 6, 44, 52, 
54, 59, 76, 84-5, 111, 
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168, 182, 185, 192, 213, 
221-3, 236-9, 273-6, 279, 
286, 298, 317, 351, 405-6 

multiplicity (of edge) 166, 
265, 275, 279, 395, 453, 
455, 460-1, 466, 468-70 

Mycielski’s construction 
205-6, 215, 258 


Nash-Williams’ Orientation 
Theorem 175 

natural number 471 

nearest-insertion algorithm 
497, 505 

nearest-neighbor algorithm 
496-7 

necessary condition 24 

necklace 173 

negation 477 

neighborhood 34, 45, 58, 
116, 121, 162, 215-6, 
224, 325, 341, 346, 348, 
368, 390, 438, 463, 468 

neighbor 2 

net demand 184 

network 1, 149, 161, 165, 
176-90, 463, 495 

network flow 176, 182, 184; 
186, 189, 495 

node 55, 101, 176~—88, 388— 
9, 391, 449 

NODUP scheme 423 

NOHO property 407—8, 423 

nondeterministic 494—5 

nonplanar graph 243, 247-8, 
252, 269, 365 

nonseparating 251 

nontrivial graph 22 

Nordhaus—Gaddum Theorem 

nowhere-zero flow 307-8 

NP 369, 390, 439-41, 495-7, 
499-506 

NP-complete 369, 440, 495, 
497, 499-506 

NP-hard 390, 439, 495, 499— 
500, 502, 506 

null graph 3, 435 


O(f) 94, 106, 124, 228, 
387-8, 437, 494 

o-triangulated 330, 347 

obstruction 269, 278, 331-2 

obstruction-free ordering 
331-2 

odd antihole 340, 343 
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odd component 136-9, 147 

odd cycle 24-8, 32, 41, 49, 
57-8, 63~7, 112, 122, 
138, 142, 174, 192, 195— 
7, 199-200, 203-4, 239, 
276, 285, 320, 330, 334, 
339-44, 347, 357, 410, 
455, 472, 475-8 

odd degree 15, 30, 34-5, 
43, 47, 77, 100, 385, 389, 
414, 498-9 

odd hole 340-1, 343 

odd number 473 

odd triangle 281 

odd vertex 30, 36, 100 

odd walk 24-5, 57-8 

one-to-one correspondence 7, 
37, 50, 81, 473, 484 

One-Way Street Problem 
130, 165, 422 

open set 235 

optimal coloring 192, 194, 
197, 199, 202, 207, 215, 
217, 227, 324, 326, 331, 
336, 340, 344, 348 

optimization problem 39, 
113-4, 179, 322, 396 

order (of a graph) 35 

order (of recurrence) 483 

order-preserving property 
358-60 

ordered graph 406, 423 

ordered pair 8, 21, 53-4, 
56, 61, 93, 190, 294, 299, 
309, 393, 420, 440, 474, 
489-90 

Ore’s condition 297 

Ore’s Theorem 417-8, 420, 
424 

orientable cycle double cover 
318 

orientation 62-5, 86, 89, 94, 
147, 165-6, 174-5, 196-— 
7, 203, 228-32, 244, 293, 
307-8, 311, 317, 329-32, 
345, 376, 379, 393, 410- 
3, 424, 449, 484 

oriented graph 62 

orthonormal 456-7, 461 

out-neighborhood 58 

out-tree 89-91, 98 

outdegree 58-66, 89, 130, 
190, 299, 410, 449, 503 

outer face 235, 240, 412 


outerplanar graph 239-40, 
243, 256, 269-71 

outerplane graph 239-40, 
244-5 

Overfull Conjecture 278-9, 
285 

overfull subgraph 278-9, 
284-5 


P3 11, 32, 48, 64, 163, 173, 
199, 223, 333, 386, 395, 
417-8, 465 

Py 11-2, 15-8, 23, 33-4, 50, 
52, 108, 138, 147, 163, 
202, 221, 344, 347, 417 

Ps 12, 23, 231, 404 

p-critical 334-6, 339-44, 348 

P=NP? 441, 495, 497 

pair-disjoint 447-8 

pairwise 489 

parallel elements (in ma- 
troid) 351-2, 373, 375 

parallel computation (see 
nondeterministic) 


Ps 


parent 100-1,:147, 157, 220, 
402 4 

parity 17, 27,236, 46, 137-9, 
142, 148, 236, 239, 271- 
2, 301-2, 310, 312, 317, 
347, 388, 473, 490 

parity graph 330 

parity lemma 148 

parity subgraph 312, 317 

partial transversal 127-8, 
353 

partite set 4 

k-partite 5, 51, 192, 207, 296 

partition 473 

partition matroid 357-8, 
366, 368, 370-1, 373 

partitionable graph 335-42, 
347-8 

Pascal’s Formula 73, 488 

path 5-34, 38, 43, 47, 55- 
64, 67-81, 84, 88-106, 
109-12, 119, 123-4, 129- 
34, 142-5, 147, 151-83 

u, v-path 20 

X, Y-path 166, 170, 175 

paw 12, 31, 236, 279-81 

pendant vertex 67 

perfect graph 204, 224, 226- 
8, 319-24, 328, 330-7, 
341, 343-4, 347, 413 
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a-perfect 319-22 

B-perfect 335 

y-perfect 319-22 

perfect elimination ordering 
224 

Perfect Graph Theorem 
(PGT) 226-7, 320-2, 
334-5, 344, 413 

perfect matching 107-8, 
111-4, 118-22, 125-9, 
131, 134-6, 139, 141, 
146, 148, 211, 274-6, 
283, 295, 318, 374, 424, 
451-2, 469 

perfect order 331-2. 

perfectly orderable graph 
331, 347 

performance ratio 202, 496, 
498-9, 505 

permutation 8, 14, 18, 32-3, 
55, 64, 107, 120, 332, 390, 
448-9, 453-—4, 470, 486 

permutation matrix 120, 470 

Petersen graph 13-8, 37, 41, 
50, 71, 79, 87, 108, 119, 
122, 139, 159, 175, 192, 
197, 203, 230, 245, 251, 
255, 269, 276, 279, 283— 
4, 288, 292, 295-7, 304- 
18, 470; 487 

PGT (see Perfect Graph The- 
orem) 

pigeonhole principle 151, 
171, 230, 261, 378-93, 
491-2 

pigeonhole property 427 

planar graph 5, 234-62, 
266, 269-72, 274, 301-— 
4, 307, 309, 312, 315-6, 
341, 349, 358-65, 376, 
411-2, 423-4, 469, 504-6 

PLANAR 3-COLORABILITY 

planar embedding 235-6, 
241-8, 251-4, 271, 376 

planar map 238 

PLANARITY 252, 495 

plane graph 235-45, 254-6, 
270, 300-3, 307-9, 312, 
314, 360, 363-5, 375, 412 

planted tree 101 

Platonic solid 242 

Poisson distribution 434 

polygon 242, 247-8, 255-6, 
270-1, 452 
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polygonal curve 48, 234-5, 
245 

polyhedron 242-3 

polynomial-time algorithm 
124-5, 253, 269, 282, 
377, 438-9, 493-500, 
504-5 

positional game 120 

positive k-flow 307, 318 

kth power (of graph) 296 

predecessor 54, 58, 62, 98, 
294, 417 

predecessor set 58 

prefix-free code 101-3, 106 

pretzel 266 

Prim’s Algorithm 97, 104 

prime snark 305 

principal submatrix 454, 458 

probabilistic analysis 425 

probabilistic method 117, 
206, 385, 410, 425-52 

probability model 425-7, 
430 


probability space 426-7, 
430, 436, 443, 445 

product dimension 397-9, 
422 

product representation 
398-9 

proper 191, 275, 300, 388, 
472 

proper coloring 191-2, 196- 
201, 204-5, 217-20, 223, 
227, 300, 321, 408, 410, 
412, 423-4, 449, 483 

proper edge-coloring 275-7, 
285, 381, 409 

proper face-coloring 300 

proper interval graph 347 

proper labeling 388-9, 391, 
395 

proper subgraph 192, 212, 
247, 472 

proper subset 120, 152, 155, 
356, 464, 472 

Proposal Algorithm 131, 
134-5, 411 

Priifer code 81-3, 92, 345 


quadratic growth 483 
quota 380-5 


radius 71—2, 75, 78, 265 
Ramsey multiplicity 395 


Ramsey number 206, 380-8, 
394, 426, 428, 437, 450 
Ramsey number (for graphs) 
386 

Ramsey’s Theorem 378, 
380-6, 388, 393 

random graph 196, 425-6, 
430, 482, 436-40, 445-7, 
450, 463 

random variable 427-33, 
442-3, 445-7, 452, 469 

rank function (of matroid) 
349-50, 354-61, 364, 
366-77, 406 

rank (of matrix) 453-5 

rational numbers 471-2 

real numbers 120, 129, 203, 
471-2 

reciprocity 229 

Reconstruction Conjecture 
38 

recurrence 84—5, 94, 106, 
221-3, 228-30, 232, 272, 
468, 483 = 

reduces to 499, 506 

reducible configuration 258- 
61, 265, 270 

reduction from 499-500, 
505-6 Eb 

reflexive property (relations) 
490 

region 1, 5, 191, 233, 235, 
238-9, 245, 247, 255, 
258, 268, 316, 391, 474, 
479-80 

regular graph 34-44, 48-53, 
79, 87, 92, 116, 122, 136, 
139-40, 147, 153, 175, 
190, 198, 201, 204, 242, 
276, 283-5, 295-305, 308, 
314, 317, 385, 387, 434, 
453, 460-6, 470, 505-6 

3-regular 37, 40, 43-4, 49, 
52-3, 92, 122, 136, 139, 
146-7, 153, 158-9, 173, 
175, 243-5, 271-2, 276, 
292, 295-6, 300-5, 308, 
314-8, 385, 424, 505-6 

k-regular 34-6, 49, 51, 79, 
111, 116, 136, 140, 146— 
7, 151, 159, 198, 276, 
284-5, 296, 411, 428, 
460-2, 464, 466, 469 

regular embedding 272 


| 


585 


relation 8, 489 

remainder classes 491 

Replacement Lemma 334 

representable matroid 351 

restriction matroid 363-4, 
370, 375-7 

restriction martingale 445-7 

reverse edge 66 

right child 101 

right subtree 101 

ring (in planar configura- 
tion) 258-60, 270 

Ringel’s Conjecture 87 

road network 5—6, 99, 112, 
165 

Robbins’ Theorem 166 

root 67, 89-90, 94, 100-1, 
106, 147, 157, 198, 220, 
229, 345-6, 402, 406, 
433, 449, 453, 455, 466 

rootable family 345 

rooted plane tree 101, 106 

rooted tree 100, 106, 404 

rule of product 486 

rule of sum 485 

running time 97, 124-5, 
132, 425, 430, 494 


2-SAT 500, 505 

3-SAT 500-1, 505-6 

SATISFIABILITY 499-500, 
506 

satisfiable formula 499-500, 
506 

saturated vertex 107, 110, 
118, 124, 133, 139, 142- 
4, 352, 377 

Schur’s Theorem 393 

score sequence 62 

SDR (see System of Distinct 
Representatives) 

Second Moment Method 
433-4, 487, 440-3, 450 

selection (subset) 486 

self-complementary 11-2, 
17, 32, 245, 271, 320 

Semi-strong Perfect Graph 
Theorem 344 

separating set 149-50, 153, 
158, 162, 164, 169, 183, 
200, 218, 231, 251 

separator 166, 231, 345 

sequence 483 

set 471 
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r-set 380-3 

Shannon bound 103, 106, 
275, 279, 285 

Shannon Switching Game 
365-6 

sharp 39, 70, 117, 123, 159, 
210, 216, 269, 279, 284, 
298, 339, 399, 434, 450 

sharp threshold 434 

shift graph 202 

SHORTEST CYCLE 495 

shortest cycle 13, 217 

shortest path 29, 34, 73, 76— 
7, 97-8, 100, 400 

simple curve 235 

simple digraph 54-5, 59, 
61-4 

simple graph 2 

simple hereditary system 
(matroid) 351, 375 

simple polygon 255, 270 

simplicial construction or- 
dering 325-6 

simplicial elimination order- 
ing 224-7, 231, 324-7, 
344 

simplicial subdivision 388-9, 
391, 395 

simplicial vertex 224-7, 231, 
325-7, 331, 422 

sink 1, 176-89, 373, 449, 
463 

sink set 178 

sink vertex 176 

size 35, 473 

size of decomposition 414 

size of matching 114 

skew partition 347 

snark 305-7, 312, 314, 317 

f-soluble 148 

source 1, 176-89, 266, 373, 
413-4, 463 

source set 178, 413 

source vertex 176 

source/sink cut 178-80, 
188-9 

span function (of matroid) 
358-60, 375 

spanning cycle 231, 240, 
252, 273-4, 276, 284, 
286-98, 303-4, 314, 317, 
376, 421, 437, 493-4, 
497-9, 505 


spanning path 94, 104, 200, 
287, 292, 498, 502 

spanning set (of matroid) 
360-1, 376—7 

spanning subgraph 67, 95, 
136, 140, 160, 223, 243, 
312, 343, 351, 353, 373, 
399, 454, 459 

spanning tree 67-70, 73-87, 
92-8, 103-5, 123, 147, 
157-8, 160, 174, 190, 
198, 216, 221, 232, 244, 
312, 327-8, 349, 351, 
354, 360, 363, 365, 372, 
377-9, 402-6, 424, 451, 
462-3, 469, 483, 496, 
498, 505 

spans (in matroid) 358, 365 

sparse graph 145, 437, 440 

Spectral Theorem 456-8 

spectrum 453, 455, 462, 
468-70 

Sperner’s Lemma 378, 388- 
91,395 = 

SPGC (see Strong Perfect 
Graph Conjecture) 

spine of caterpillar 88 

split graph 345 

split of digraph 59, 424 

squashed- cube dimension 
397, 401, 403, 422, 468 

stable matching 131-2, 134— 
6, 411 

Stable Roommates Problem 
135 

stable set 4, 319-23, 326, 
330-48, 372, 441, 447-8, 
506 

standard deviation 433 

star 67, 71-2, 76, 78, 80-1, 
88, 115-6, 121, 214, 275, 
333-4, 344, 413, 459-60 

star-cutset 333-4, 344 

Star-Cutset Lemma 333-4, 
347 

r-staset 447-8 

Steinitz exchange property 
358-60 

stem (of blossom) 142-3 

Stirling’s approximation 440 

straight-line embedding 251, 
255-6 

Street-Sweeping Problem 
129 
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strength of theorem 440 

strict digraph 294, 299, 420 

strict gammoid 377 

strong absorption property 
355-6 - 

strong component 56-7, 63- 
4, 156, 160 

strong digraph 58, 65, 90, 
165, 420 

strong duality 179-80, 323 

strong elimination property 
359, 374 

Strong Embedding Conjec- 
ture 313 

strong induction 19, 66, 79, 
480 

strong orientation 165 

Strong Perfect Graph Con- 
jecture (SPGC) 320, 334— 
7, 339-44, 347-8 

strongly connected digraph 
56, 60-1, 63, 65, 89, 164, 
245, 420, 450 

strongly perfect graph 330- 
1, 347 

strongly regular graph 464— 

, 470 

subconstituent 470 

subcube 36, 49, 295, 401 

subdivision 162-3, 173, 212— 
5, 218-9, 246-51, 256, 
269, 272, 304-5, 310- 
1, 314, 365, 388-9, 391, 
395, 442 

H-subdivision 212, 213, 218 

subgraph 6, 56 

submodular function 373 

submodularity property 
354-6, 367, 370-4, 377 

subset 471 

subtree 80-3, 86-7, 93, 101, 
106, 157, 324—7, 344-5, 
436, 449 

subtree representation 324— 
5, 327 

successor 54, 57-8, 60-2, 
190, 294, 345, 410-1, 
421, 451 

successor set 58 

sufficient condition 24 

sum of graphs 39 (see dis- 
Joint union) 

sum of matroids 369-70, 
406 
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summand 485 

supbase (of matroid) 360-1, 
363 

superconcentrator 463 

supergraph 297 

superregular graph 470 

supply 130, 184, 187 

sweep subgraph 130 

2-switch 46~—7, 53 

absorption property 351, 
357, 377 

Sylvester’s Law of Inertia 
457, 459 

symmetric difference 109-— 
10, 118-19, 122, 133, 
137-8, 160, 314, 348, 
352-3, 359, 374, 376, 
467, 473-4 

symmetric digraph 175, 502 

symmetric matrix 6, 456-8, 
469 

symmetric property (rela- 
tions) 490 

system of distinct represen- 
tatives 119, 171, 369 

Szekeres—Wilf number 231 

Szekeres—Wilf Theorem 201 

Szemerédi Regularity 
Lemma 388 


tail (of edge) 53-60, 86, 91, 
164—5, 168, 178, 307-8, 
357-8, 484, 503 

tail partition matroid 358 

Tait coloring 301-2, 314 

Tait’s Conjecture 302, 304 

Tait’s Theorem 307-9, 311, 
314 

target (of function) 483 

Tarry’s Algorithm 95 

telegraph problem 423 

telephone problem 422 

tensor product 201 

ternary matroid 357 

thickness 261, 271 

threshold (in Ramsey theory) 
380-1, 387 

threshold function 425, 433- 
7, 440-1, 450-1 

Tic-Tac-Toe 120 

tolerance (of path) 177--80 

TONCAS 28-9, 44, 110, 136, 
184, 225, 246 

toroidal 266—8, 272, 341 


torus 266-9, 272, 317 

total coloring 411, 423 

Total Coloring Conjecture 
411 

total dominating set 117, 
122 

totally unimodular 469 

t-tough 288 

toughness 288, 292, 297 

tournament 62-6, 190, 200, 
293, 299, 329, 393, 413, 
428, 450-1 

trace 453-4 

traffic lights 201, 266, 328 

trail 20, 26-34, 60, 64, 77,. 
90, 100, 106. 173, 295, 
313, 380, 393, 506 

u, v-trail 20, 34 

transformation 47, 59, 64, 
135, 138, 141, 168, 171, 
182-3, 186, 189, 285, 
292, 360, 422, 494, 499- 
502 ‘ 

transformation from 499 

transitive digraph 228, 413, 
424 eS 

transitive graph 14, 18 

transitive orientation 228, 
231, 331, 413 

transitive property (rela- 
tions) 490 

transitivity of dependence 
(matroids) 359 

transportation transporta- 
tion network 184-5 

Transportation Problem 130, 
185 

transposition 454 

transversal matroid 352--3. 
357, 368-9, 373, 376—7 

transversal (of matrix) 126- 
8, 135 

Traveling Salesman Problem 
(TSP) 452, 493-4, 496-8, 
505 

tree 67-106, 118, 122--3, 
146-7, 155-8, 174, 190, 
198, 202, 204, 214, 216, 
219-21, 224, 229, 244-5, 
296, 312, 315, 317, 323- 
4, 327-8, 344-6, 349— 
51, 354, 360, 363. 365. 
372, 377-9, 386, 390, 
393-4, 396, 402-7, 424, 
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436, 449, 451, 455, 462- 
3, 467-9, 492, 498-9, 505 

k-tree 345 

triangle 12, 18, 41-2, 48~ 
52, 164, 223, 230-1, 256, 
275, 279-3, 285-6, 296, 
305, 336, 349, 397, 399, 
409, 412, 422, 424, 454, 
465, 469 

triangle-free graph 41- 
2, 50-1, 159, 193, 203, 
205-6, 208-9, 216, 241- 
2, 261-3, 270, 397, 399, 
424, 469, 476 

triangle inequality 66, 78, 
498, 505 

triangle, monochromatic 
378-9, 383-9, 391, 394-5, 
449 

triangular grid 390-2 

o-triangulated graph 

triangulation 242, 245, 258— 
61, 270, 300, 307, 315, 
378, 388-9, 424 

trivial 22 

trivial edge cut 304-6 

Tucker’s Algorithm 34 

k-tuple 15, 26, 32-3, 35-6, 
49, 76, 379, 400, 474, 486 

Turan graph 207-10, 216 

Turan’s Theorem 209-10, 
216-7 

Tutte graph 303 

Tutte’s 1-factor Theorem 
146, 283 

Tutte’s Condition 136-7, 
139, 141, 146-7 

Tutte’s Conjectures 

Tutte’s Theorem 139, 146-8, 
250 

twin 348 

two-step method 428 


unavoidable set 258, 260—1 

underlying graph 56, 60, 66, 
89,175, 177 

k-uniform hypergraph 449 

uniform matroid 357, 370, 
373, 376, 406 

uniformity property 354-6, 
359, 361, 374 

union of graphs 25 

union of digraphs 56 

union of matroids 369-78 
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union of sets 473 
unipathic digraph 66 
unit interval graph 346 
unit-distance graph 201 
universal quantifier 475-6 
universe 223, 472-6 
unlabeled graph 9, 38 
unsaturated (vertex/edge) 
107, 109-11, 115, 123, 
129, 132, 134, 139, 142— 
5, 147, 368, 377 
unstable pair 130 


value of flow 176 

variance 433 

vector space 349, 351, 355, 
452-3, 467, 470 

vectorial matroid 351-2, 
355, 373 

Venn diagram 474 

vertex 2 

n-vertex graph 34 

vertex k-split 174 

VERTEX COVER 496, 502- 
3, 506 

vertex cover 112-8, 121, 


123-9, 146, 168, 227, 349, 


368, 413, 459, 502-3, 506 


vertex cut 149-53, 164, 218, 
248, 333, 376 

vertex duplication 321-2, 
348 

vertex multiplication 320, 
322 

vertex ordering 6, 55, 194- 
202, 298, 331-2, 428, 451 

vertex separator 231, 345 

vertex set 2 

vertex split 

vertex-color-critical 218 

vertex-deleted subgraphs 
37-8 

vertex-transitive 14, 18 

Vizing’s Theorem 275, 284— 
5, 399, 409 


Wagner’s Theorem 269 

walk 20-2, 24-5, 31-3, 48, 
57-8, 60, 63, 65, 99, 203, 
236-9, 392, 455, 458, 461 

weak absorption property 
351, 354, 356, 374, 377 

weak elimination property 
352, 353, 359, 373-5 

weak dual 244 

weak duality 323, 367, 376 
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weak elimination property 
352-3, 355-6, 359, 373-5 

weakly chordal graph 330-1, 
334, 347 

weakly connected 56, 60 

weighted average 389, 427 

weighted cover 125-9 

weighted graph 95-8, 103-6, 
134, 190, 372, 377, 494, 
498, 506 

weighted intersection graph 
327 

weighted matching 125-6, 

_ 145, 366 

Well Ordering Property 19, 
479 

wheel (Ki V Cy-1) 174, 229 

Whitney’s Theorem 166 

Wiener index 72 

winning strategy 57, 74, 
119, 366 

Woodall’s Theorem 420, 424 

word form of permutation 
101, 486 


zero flow 176, 180-1, 184, 
187, 307-8 


continued from inside front cover 


1(D) 
I(F) 
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Nlv] 


N*(v), N7(v) 


n(G) 


O(f), off) 
O(H) 


Sy 
Spec(G) 
Ar 


maximum length of path 
length of a face 
logarithm base 2 
natural logarithm 
matching 

incidence matrix 

cycle matroid of G 

dual hereditary system 
contraction of M to F 
restriction of M to F 

set of natural numbers 
network 

(open) neighborhood 
closed neighborhood 
out-, in-neighborhood 
order (number of vertices) 
growth rate 

number of odd components 
probability of an event 
path with n vertices 
product dimension 
squashed-cube dimension 
k-dimensional hypercube 
radius 

Ramsey number 

graph Ramsey number 
set of real numbers 
RxR 

rank function of matroid 
surface with y handles 
spectrum (eigenvalues) 
transpose of matrix 

tree, tournament 

Turan graph 

size of Turan graph 
uniform matroid 

upper bound on flow 
value of a flow f 

vertex set 

wheel with n vertices 
weight of edge 

set of integers 

integers modulo p 
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a(G) 
a'(G) 
B(G) 
B'(G) 
y(G) 
A(G) 


A*(G), A-(G) 


5(G) 


8*(G), 8" (G) 


d(v) 
€g(u) 
O(f) 
6(G) 
6'(G) 
K(G) 
K'(G) 
K(x, y) 
K'(x, y) 
K(r: G) 
A(x, ¥) 
A(x, y) 
Career & 
Ly, +++, Mn 
u(ey, U(G) 
v(G) 

IT 

p(G) 

» 
Oo,m,T 
a(v) 

Om 

t(G) 
Y(G) 
@(G; ) 
x(G) 
x'(G) 
x(G; k) 
x(G) 
w(G; d) 
Q(f), o(f) 
w(G) 
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independence number 
maximum size of matching 
vertex cover number 
edge cover number 
genus, domination number 
maximum degree 
maximum out-, in-degree 
minimum degree 
minimum out-, in-degree 
demand at a vertex 
eccentricity of vu in G 
growth rate 

clique cover number 
intersection number 
(vertex) connectivity 
edge-connectivity 

local connectivity 

local edge-connectivity 
local-global connectivity 
max # disjoint paths 

max # edge-disjoint paths 
eigenvalues 

eigenvalues 

edge multiplicity 
crossing number 

product 

maximum density 
summation 

permutation 

supply at a vertex 

span function 

number of spanning trees 
arboricity 

characteristic polynomial 
chromatic number 
edge-chromatic number 
chromatic polynomial 

list chromatic number 
minimum polynomial 
growth rate 

clique number 


